Skip to main content
POST
/
models
/
v2
/
openai
/
v1
/
chat
/
completions
Chat Completions
curl --request POST \
  --url https://api.bytez.com/models/v2/openai/v1/chat/completions \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "<string>",
  "messages": [
    {
      "role": "system",
      "content": "<string>"
    }
  ],
  "temperature": 0.7,
  "top_p": 1,
  "stream": false
}'
{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "choices": [
    {
      "index": 123,
      "message": {
        "role": "<string>",
        "content": "<string>"
      },
      "finish_reason": "<string>"
    }
  ]
}

Body

application/json
model
string
required

The ID of the model to run (e.g., "Qwen/Qwen3-1.7B", "openai/gpt-4")

messages
object[]
required

Conversation messages (OpenAI chat format)

temperature
number
default:0.7

Sampling temperature

top_p
number
default:1

Nucleus sampling probability

stream
boolean
default:false

Whether to stream responses

Response

Successful model completion

id
string

Unique ID for this completion

object
string

Type of returned object (usually "chat.completion")

created
integer

Unix timestamp of completion

choices
object[]

Generated completions

I