Lowest-Latency Perceptron Models

Quick answer · Updated June 2026

Perceptron Mk1 has the lowest latency of any Perceptron model, responding in about 941ms to first token.

941msLatency

24 t/sSpeed

$0.150Input /M

33KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

1P
perceptron-mk1
ReasoningJSONVision$0.150/M · 24 t/s · 33K ctx
941ms
Latency

Frequently asked

Which Perceptron model has the lowest latency?

Perceptron Mk1 has the lowest latency of any Perceptron model, responding in about 941ms to first token.

How many Perceptron models are there?

modelgrep tracks 1 Perceptron models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Perceptron rankings

Perceptron: Smartest LLMs Perceptron: Best LLMs for Coding Perceptron: Best LLMs for Design & Frontend Perceptron: Fastest LLMs Perceptron: Cheapest LLMs Perceptron: Best Free LLMs Perceptron: Best Reasoning LLMs Perceptron: Best Vision LLMs Perceptron: Best LLMs for Agents Perceptron: Best Open-Source LLMs Perceptron: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs