创建 Completions

POST

/v1/completions

基于给定提示词从大语言模型生成响应。

授权

发起 REST API 请求时，必须在请求头中包含 AccessToken 以及 Content-Type 头。您可以使用以下格式进行授权：

--header 'Authorization: Bearer <your_token_here>'
--header 'Content-Type: application/json'

注意：请将 your_token_here 替换为您的实际 AccessToken。它包含允许服务器验证您的身份和权限的信息。您可以在此处创建 API 密钥。

请求体

字段	类型	必填	描述
`model`	string	是	用于生成响应的模型名称
`prompt`	string, string[]	是	用于生成响应的提示词，可以是单个字符串或字符串数组
`stream`	boolean, null	否	是否在生成时以流式方式返回响应（默认：false）
`max_tokens`	integer, null	否	响应中生成的最大 Token 数
`temperature`	number, null	否	生成响应时使用的采样温度
`top_k`	integer, null	否	生成响应时保留的最高概率 Token 数量，用于 top-k 采样，有助于控制输出的随机性。
`top_p`	number, null	否	生成响应时使用的核采样概率，有助于在生成的响应中平衡随机性和连贯性。
`stop`	string, string[]	否	一个或多个序列，API 遇到这些序列时将停止生成更多 Token。返回的文本不包含停止序列。
`presence_penalty`	number, null	否	根据 Token 是否出现在对话历史中施加的惩罚，有助于减少重复。
`frequency_penalty`	number, null	否	根据 Token 在对话历史中的频率施加的惩罚，有助于减少常见 Token。
`response_format`	object	否	要生成的响应格式。
`n`	integer	否	为每个输入消息生成的响应选择数量。
`seed`	integer, null	否	用于生成响应的随机种子，确保可重复性。

响应格式配置

字段	类型	必填	描述
`type`	string	是	响应格式类型，`json_object`、`json_schema`、`grammar`、`text`
`schema`	string, object	否	schema 字段定义了模型输出的预期结构。它引导模型生成遵循指定 JSON 格式的响应。
`grammar`	string, null	否	grammar 字段定义了一组约束模型输出格式的规则。它确保生成的文本遵循指定的语法或结构化模式。
`json_schema`	string, object	否	`json_schema` 字段定义了模型输出必须遵循的严格 JSON Schema。它确保生成的响应匹配指定的结构和数据类型。

请求示例

{
  "model": "minimax/minimax-m2.5",
  "prompt": "Hello, how are you?"
  "stream": false,
  "max_tokens": 100,
  "temperature": 0.7,
  "top_k": 50,
  "top_p": 0.9,
  "n": 1,
  "stop": null,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "response_format": {
    "type": "json_schema",
    "json_schema": {
      "name": "response_schema",
      "schema": {
        "type": "object",
        "properties": {
          "response": {
            "type": "string",
            "description": "The assistant response to the user message."
          }
        },
        "required": ["response"],
        "additionalProperties": false
      }
    }
  },
  "seed": 42
}

响应

成功响应

字段	类型	描述
`id`	string	聊天补全响应的唯一标识符
`created`	integer	聊天补全创建时的时间戳（自 Unix 纪元以来的秒数）
`model`	string	用于生成响应的模型名称
`choices`	array	模型生成的响应选择数组。每个选择包括生成的消息、完成原因和选择的索引。
`object`	string	返回的对象类型，此端点为 "completion"
`usage`	object	包含请求和响应 Token 使用信息的对象，包括提示词中的 Token 数、补全中的 Token 数和使用的总 Token 数。
`metadata`	object	包含有关响应的附加元数据的对象，例如模型使用的权重版本。

创建 Completions

curl 'https://api.luchentech.com/inference/v1/completions'   -H 'content-type: application/json'   -H 'Authorization: Bearer <your_token_here>'   --data '   {                                      "model": "minimax/minimax-m2.5",    "prompt": "Explain the concept of a polymer in simple terms.",    "max_tokens": 100,    "temperature": 0.7,    "top_k": 50,    "top_p": 0.9,    "n": 1,    "stop": null,    "response_format": { "type": "json_object" },    "seed": 42  }'

{    "id": "e63095aef9bc4d7292b769edb2cb6583",    "object": "chat.completion",    "created": 1773651537,    "model": "minimax/minimax-m2.5",    "choices": [        {            "index": 0,            "message": {                "role": "assistant",                "content": "Hi there! I'm doing well, thank you for asking. How about you? How's your day going so far? Is there anything I can help you with today?",                "reasoning_content": null,                "tool_calls": null            },            "logprobs": null,            "finish_reason": "stop",            "matched_stop": 248046        }    ],    "usage": {        "prompt_tokens": 15,        "total_tokens": 692,        "completion_tokens": 677,        "prompt_tokens_details": null,        "reasoning_tokens": 0    },    "metadata": {        "weight_version": "default"    }}

授权​

请求体​

响应格式配置​

请求示例​

响应​

成功响应​

授权

请求体

响应格式配置

请求示例

响应

成功响应