google/gemma-3-4b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrabf16 | $0.050 | $0.100 | 131K | 100% |
Gemma 3 4B costs $0.050 per million input tokens and $0.100 per million output tokens via OpenRouter, making it 19th cheapest of 298 paid models.
Gemma 3 4B scores 6.3 on the Artificial Analysis Intelligence Index, ranking 177th of 178 benchmarked models, with a GPQA Diamond score of 29%.
Gemma 3 4B generates around 20 tokens per second with 566ms time-to-first-token (p50), the 263rd fastest tracked model.
Gemma 3 4B supports a 131K-token context window and can output up to 16K tokens. It accepts text, image input.