modelgrep
O

OpenAI: GPT-3.5 Turbo 16k

openai/gpt-3.5-turbo-16k

ToolsJSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
21
262nd fastest
Latency
314ms
first token
Input price
$3.00
271st cheapest
Context
16K
4K max out

How it compares

Faster than11%
of all ranked models
Cheaper than9%
of all ranked models

Overview

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...

Providers & pricing (2)

ProviderIn $/MOut $/MUptime
OpenAI$3.00$4.00
Azure$3.00$4.00

Specifications

Context window16K
Max output4K
Knowledge cutoffSep 2021
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedYes

GPT-3.5 Turbo 16k FAQ

How much does GPT-3.5 Turbo 16k cost?

GPT-3.5 Turbo 16k costs $3.00 per million input tokens and $4.00 per million output tokens via OpenRouter, making it 271st cheapest of 298 paid models.

How fast is GPT-3.5 Turbo 16k?

GPT-3.5 Turbo 16k generates around 21 tokens per second with 314ms time-to-first-token (p50), the 262nd fastest tracked model.

What is GPT-3.5 Turbo 16k's context window?

GPT-3.5 Turbo 16k supports a 16K-token context window and can output up to 4K tokens. It accepts text input.

Compare head-to-head