New
Work in progress: Agents Directory has just launched. Stay tuned, more content is on the way.
Sign InGPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...
Capabilities:
Input
- Text input
- Image input (vision)
- File input (PDF)
- Audio input
- Video input
Output
- Text output
- Image output
- Audio output
Pricing & availability:
OpenRouter
$2.5 / $10 per M
Sources:
Details:
Provider
OpenAIContext window
128KInput price
$2.5/MOutput price
$10/MOpen weights
NoReleased
Jan 2026