聊天接口

聊天接口是 New API 的核心功能，兼容 OpenAI Chat Completions API。

请求

http

POST /v1/chat/completions

请求头

http

Authorization: Bearer your-api-key
Content-Type: application/json

请求体

字段	类型	必填	说明
model	string	是	模型名称
messages	array	是	消息列表
stream	boolean	否	是否流式输出，默认 false
temperature	number	否	采样温度，0-2，默认 1
max_tokens	integer	否	最大生成 Token 数
top_p	number	否	核采样，0-1
frequency_penalty	number	否	频率惩罚，-2 到 2
presence_penalty	number	否	存在惩罚，-2 到 2

Messages 格式

json

[
  {"role": "system", "content": "You are a helpful assistant."},
  {"role": "user", "content": "Hello!"},
  {"role": "assistant", "content": "Hi there! How can I help you today?"},
  {"role": "user", "content": "What's the weather like?"}
]

角色说明：

system：系统提示，设定助手行为
user：用户输入
assistant：助手回复

响应

非流式响应

json

{
  "id": "chatcmpl-xxxxxxxx",
  "object": "chat.completion",
  "created": 1677652288,
  "model": "gpt-4",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! How can I assist you today?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 9,
    "completion_tokens": 12,
    "total_tokens": 21
  }
}

流式响应

设置 stream: true，返回 SSE 格式：

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4","choices":[{"index":0,"delta":{"role":"assistant"},"finish_reason":null}]}

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4","choices":[{"index":0,"delta":{"content":"Hello"},"finish_reason":null}]}

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4","choices":[{"index":0,"delta":{"content":"!"},"finish_reason":null}]}

data: {"id":"chatcmpl-xxx","object":"chat.completion.chunk","created":1677652288,"model":"gpt-4","choices":[{"index":0,"delta":{},"finish_reason":"stop"}]}

data: [DONE]

示例代码

Python 流式

python

from openai import OpenAI

client = OpenAI(
    base_url="https://your-domain.com/v1",
    api_key="your-api-key"
)

stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content is not None:
        print(chunk.choices[0].delta.content, end="")

JavaScript 流式

javascript

const response = await fetch('https://your-domain.com/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer your-api-key',
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    model: 'gpt-4',
    messages: [{ role: 'user', content: 'Hello!' }],
    stream: true
  })
});

const reader = response.body.getReader();
const decoder = new TextDecoder();

while (true) {
  const { done, value } = await reader.read();
  if (done) break;
  
  const chunk = decoder.decode(value);
  const lines = chunk.split('\n');
  
  for (const line of lines) {
    if (line.startsWith('data: ')) {
      const data = line.slice(6);
      if (data === '[DONE]') continue;
      
      const parsed = JSON.parse(data);
      const content = parsed.choices[0]?.delta?.content;
      if (content) {
        process.stdout.write(content);
      }
    }
  }
}

聊天接口 ​

请求 ​

请求头 ​

请求体 ​

Messages 格式 ​

响应 ​

非流式响应 ​

流式响应 ​

示例代码 ​

Python 流式 ​

JavaScript 流式 ​

聊天接口

请求

请求头

请求体

Messages 格式

响应

非流式响应

流式响应

示例代码

Python 流式

JavaScript 流式