inclusionai/ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Novita | $0.010 | $0.030 | 262K | 100% |
Ling-2.6-flash costs $0.010 per million input tokens and $0.030 per million output tokens via OpenRouter, making it 1st cheapest of 298 paid models.
Ling-2.6-flash scores 26.2 on the Artificial Analysis Intelligence Index, ranking 92nd of 178 benchmarked models, with a GPQA Diamond score of 59%.
Ling-2.6-flash generates around 136 tokens per second with 846ms time-to-first-token (p50), the 36th fastest tracked model.
Ling-2.6-flash supports a 262K-token context window and can output up to 33K tokens. It accepts text input.