Google: Gemma 4 26B A4B

google/gemma-4-26b-a4b-it

Cheaper than 91% of paidReasoningToolsJSONVision

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.070

30th cheapest

Context

262K

16K max out

How it compares

Cheaper than91%

of all ranked models

Overview

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Providers & pricing (9)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.070	$0.340	262K	99.7%
Cloudflare	$0.100	$0.300	256K	99.8%
SiliconFlowfp8	$0.120	$0.400	262K	100%
Novitabf16	$0.130	$0.400	262K	99.8%
Ionstreambf16	$0.130	$0.400	262K	97.8%
Parasailbf16	$0.130	$0.400	262K	99.4%
Venicebf16	$0.130	$0.400	256K	99.2%
Google	$0.150	$0.600	262K	97.4%
NextBitbf16	$0.180	$0.500	262K	99.9%

Specifications

Context window262K

Max output16K

Knowledge cutoff—

Input modalitiesimage, text, video

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsgoogle/gemma-4-26B-A4B-it ↗

Gemma 4 26B A4B FAQ

How much does Gemma 4 26B A4B cost?

Gemma 4 26B A4B costs $0.070 per million input tokens and $0.340 per million output tokens via OpenRouter, making it 30th cheapest of 332 paid models.