modelgrep
I

inclusionAI: Ling-2.6-flash

inclusionai/ling-2.6-flash

92nd smartest of 178Cheaper than 100% of paidToolsJSON
Use via OpenRouter ↗
Intelligence
26.2
92nd of 178
Design Elo
Speed
136
36th fastest
Latency
846ms
first token
Input price
$0.010
1st cheapest
Context
262K
33K max out

How it compares

Smarter than48%
of all ranked models
Faster than88%
of all ranked models
Cheaper than100%
of all ranked models

Overview

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Benchmarks

independent · via OpenRouter
Artificial Analysis46th percentile
Intelligence Index
26.2
Coding Index
23.2
Agentic Index
38.1
GPQA Diamond
59%
Humanity's Last Exam
6%
SciCode
27%
Tau²-Bench (agentic)
86%

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Novita$0.010$0.030100%

Specifications

Context window262K
Max output33K
Knowledge cutoff
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price$0.0020/M
ModeratedNo

Ling-2.6-flash FAQ

How much does Ling-2.6-flash cost?

Ling-2.6-flash costs $0.010 per million input tokens and $0.030 per million output tokens via OpenRouter, making it 1st cheapest of 298 paid models.

How smart is Ling-2.6-flash?

Ling-2.6-flash scores 26.2 on the Artificial Analysis Intelligence Index, ranking 92nd of 178 benchmarked models, with a GPQA Diamond score of 59%.

How fast is Ling-2.6-flash?

Ling-2.6-flash generates around 136 tokens per second with 846ms time-to-first-token (p50), the 36th fastest tracked model.

What is Ling-2.6-flash's context window?

Ling-2.6-flash supports a 262K-token context window and can output up to 33K tokens. It accepts text input.

Compare head-to-head