openai/gpt-audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| OpenAI | $2.50 | $10.00 | 128K | — |
GPT Audio costs $2.50 per million input tokens and $10.00 per million output tokens via OpenRouter, making it 253rd cheapest of 298 paid models.
GPT Audio generates around 44 tokens per second with 547ms time-to-first-token (p50), the 193rd fastest tracked model.
GPT Audio supports a 128K-token context window and can output up to 16K tokens. It accepts text, audio input.