The fastest StepFun model is Step 3.7 Flash at 92 output tokens per second. Step 3.5 Flash (47 t/s) is next.
AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.
The fastest StepFun model is Step 3.7 Flash at 92 output tokens per second. Step 3.5 Flash (47 t/s) is next.
Step 3.5 Flash (47 t/s) is the closest alternative on this metric. See the full ranking above for the tradeoffs.
modelgrep tracks 2 StepFun models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Step 3.7 Flash. 2 of them qualify for this ranking.