modelgrep

Google: Gemini 2.5 Flash Lite vs Qwen: Qwen3 VL 8B Thinking

Google: Gemini 2.5 Flash Lite wins on more metrics (6 of 8), but the right pick depends on what you optimize for — see the breakdown below.

MetricGoogle: Gemini 2.5 Flash LiteQwen: Qwen3 VL 8B Thinking
Intelligence Index17.616.7
Coding Index9.59.8
GPQA Diamond63%58%
Design Arena Elo
Speed (tokens/sec)103139
Latency399ms508ms
Input price /M$0.100$0.117
Output price /M$0.400$1.36
Context window1.0M256K
CapabilitiesReasoningToolsJSONVisionAudioReasoningToolsJSONVision