Body
application/json
The ID of the model to run (e.g., "Qwen/Qwen3-1.7B", "openai/gpt-4")
Conversation messages (OpenAI chat format)
Sampling temperature
Nucleus sampling probability
Whether to stream responses
Sends messages to an OpenAI compatible model and returns a completion. Provides completions for all open source models that are ‘text-generation’, ‘chat’, ‘audio-text-to-text’, ‘image-text-to-text’, ‘video-text-to-text’, it also supports the closed source providers, “openai”, “anthropic”, “mistral”, “cohere”, and “google”.
The ID of the model to run (e.g., "Qwen/Qwen3-1.7B", "openai/gpt-4")
Conversation messages (OpenAI chat format)
Sampling temperature
Nucleus sampling probability
Whether to stream responses