Bytez home page
Search...
Support
Bytez-com/docs
Bytez-com/docs
Search...
Navigation
Multimodal
Chat + Audio
API Documentation
API Reference
Pricing
Company
Bytez Platform
Discord
API Reference
Overview
Endpoints
GET
Models
GET
Tasks
GET
Clusters
Closed Source
POST
OpenAI
POST
Google
POST
Cohere
POST
Anthropic
POST
Mistral
Multimodal
POST
Chat + Text
POST
Chat + Vision
POST
Chat + Audio
POST
Chat + Video
Text as Input
POST
Fill Mask
POST
Text-to-Speech
POST
Text-to-Audio
POST
Text-to-Image
POST
Translation
POST
Summarization
POST
Feature Extraction
POST
Text Classification
POST
Token Classification
POST
Text2Text Generation
POST
Text Generation
Image as Input
POST
Image to Text
POST
Image Classification
POST
Image Segmentation
POST
Depth Estimation
POST
Object Detection
POST
Mask Generation
POST
Image Feature Extraction
Multi-input
POST
Question Answering
POST
Document Question Answering
POST
Visual Question Answering
POST
Zero Shot Object Detection
POST
Zero Shot Image Classification
POST
Zero Shot Classification
Multimodal
Chat + Audio
Analyze audio using the Qwen2-Audio-7B-Instruct model.
POST
/
models
/
v2
/
Qwen
/
Qwen2-Audio-7B-Instruct
Try it
Body
application/json
messages
object[]
Show child attributes
messages.
content
object[]
Show child attributes
messages.content.
text
string
messages.content.
type
string
messages.content.
url
string
messages.
role
string
Response
200 - application/json
output
string[]
Chat + Vision
Chat + Video