OpenAI: GPT-5.1 Chat

openai/gpt-5.1-chat

ToolsJSONVision

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

140th fastest

Latency

1.1s

first token

Input price

$1.25

224th cheapest

Context

128K

32K max out

How it compares

Faster than55%

of all ranked models

Cheaper than25%

of all ranked models

Overview

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
Azurecache	$1.25	$10.00	128K	—
OpenAIcache	$1.25	$10.00	128K	100%

Specifications

Context window128K

Max output32K

Knowledge cutoff—

Input modalitiesfile, image, text

Output modalitiestext

Prompt cachingSupported

Cache read price$0.130/M

ModeratedNo

GPT-5.1 Chat FAQ

How much does GPT-5.1 Chat cost?

GPT-5.1 Chat costs $1.25 per million input tokens and $10.00 per million output tokens via OpenRouter, making it 224th cheapest of 298 paid models.

How fast is GPT-5.1 Chat?

GPT-5.1 Chat generates around 61 tokens per second with 1.1s time-to-first-token (p50), the 140th fastest tracked model.

What is GPT-5.1 Chat's context window?

GPT-5.1 Chat supports a 128K-token context window and can output up to 32K tokens. It accepts file, image, text input.

More from openai

All GPT-5.1 Chat alternatives →

openai/gpt-chat-latest

openai/gpt-5.4-image-2

Compare head-to-head

OpenAI: GPT-5.1 Chat vs OpenAI: GPT Chat Latest OpenAI: GPT-5.1 Chat vs OpenAI: GPT-5.5 Pro OpenAI: GPT-5.1 Chat vs OpenAI: GPT-5.5 OpenAI: GPT-5.1 Chat vs OpenAI: GPT-5.4 Image 2 OpenAI: GPT-5.1 Chat vs OpenAI: GPT-5.4 Nano OpenAI: GPT-5.1 Chat vs OpenAI: GPT-5.4 Mini