Google: Gemma 3n 4B

google/gemma-3n-e4b-it

Cheaper than 92% of paidJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.060

25th cheapest

Context

33K

How it compares

Cheaper than92%

of all ranked models

Overview

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Together	$0.060	$0.120	33K	100%

Specifications

Context window33K

Max output—

Knowledge cutoffAug 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsgoogle/gemma-3n-E4B-it ↗

Gemma 3n 4B FAQ

How much does Gemma 3n 4B cost?

Gemma 3n 4B costs $0.060 per million input tokens and $0.120 per million output tokens via OpenRouter, making it 25th cheapest of 332 paid models.

What is Gemma 3n 4B's context window?

Gemma 3n 4B supports a 33K-token context window. It accepts text input.

More from google

All Gemma 3n 4B alternatives →

google/gemini-3.6-flash

Intel 50.1$1.50/M

google/gemini-3.6-flash:batch

Intel 50.1$0.750/M

google/gemini-3.5-flash-lite

Intel 36.5$0.300/M

google/gemini-3.5-flash-lite:batch

Intel 36.5$0.150/M

google/gemini-3.1-flash-lite-image

Intel —$0.250/M

google/gemini-3.1-flash-image

Intel —$0.500/M

Compare head-to-head

Google: Gemma 3n 4B vs Google: Gemini 3.6 Flash Google: Gemma 3n 4B vs Google: Gemini 3.6 Flash (batch)Google: Gemma 3n 4B vs Google: Gemini 3.5 Flash Lite Google: Gemma 3n 4B vs Google: Gemini 3.5 Flash Lite (batch)Google: Gemma 3n 4B vs Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)Google: Gemma 3n 4B vs Google: Nano Banana 2 (Gemini 3.1 Flash Image)