modelgrep

Lowest-Latency Perceptron Models

Quick answer · Updated June 2026

Perceptron Mk1 has the lowest latency of any Perceptron model, responding in about 941ms to first token.

941msLatency
24 t/sSpeed
$0.150Input /M
33KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1P
    perceptron-mk1
    ReasoningJSONVision$0.150/M · 24 t/s · 33K ctx
    941ms
    Latency

Frequently asked

Which Perceptron model has the lowest latency?

Perceptron Mk1 has the lowest latency of any Perceptron model, responding in about 941ms to first token.

How many Perceptron models are there?

modelgrep tracks 1 Perceptron models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Perceptron rankings

All rankings