modelgrep
O

OpenAI: GPT-3.5 Turbo (older v0613)

openai/gpt-3.5-turbo-0613

ToolsJSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
46
185th fastest
Latency
794ms
first token
Input price
$1.00
210th cheapest
Context
4K
4K max out

How it compares

Faster than37%
of all ranked models
Cheaper than30%
of all ranked models

Overview

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Azure$1.00$2.00

Specifications

Context window4K
Max output4K
Knowledge cutoffSep 2021
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

GPT-3.5 Turbo (older v0613) FAQ

How much does GPT-3.5 Turbo (older v0613) cost?

GPT-3.5 Turbo (older v0613) costs $1.00 per million input tokens and $2.00 per million output tokens via OpenRouter, making it 210th cheapest of 298 paid models.

How fast is GPT-3.5 Turbo (older v0613)?

GPT-3.5 Turbo (older v0613) generates around 46 tokens per second with 794ms time-to-first-token (p50), the 185th fastest tracked model.

What is GPT-3.5 Turbo (older v0613)'s context window?

GPT-3.5 Turbo (older v0613) supports a 4K-token context window and can output up to 4K tokens. It accepts text input.

Compare head-to-head