google/gemma-3-12b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrabf16 | $0.050 | $0.150 | 131K | 100% |
Gemma 3 12B costs $0.050 per million input tokens and $0.150 per million output tokens via OpenRouter, making it 20th cheapest of 298 paid models.
Gemma 3 12B scores 8.8 on the Artificial Analysis Intelligence Index, ranking 170th of 178 benchmarked models, with a GPQA Diamond score of 35%.
Gemma 3 12B generates around 37 tokens per second with 501ms time-to-first-token (p50), the 211th fastest tracked model.
Gemma 3 12B supports a 131K-token context window and can output up to 16K tokens. It accepts text, image input.