modelgrep

Fastest Undi95 Models

Quick answer · Updated June 2026

The fastest Undi95 model is ReMM SLERP 13B at 23 output tokens per second.

23 t/sSpeed
$0.450Input /M
6KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

  1. 1U
    remm-slerp-l2-13b
    JSON$0.450/M · 553ms ttft · 6K ctx
    23 t/s
    Speed

Frequently asked

What is the fastest Undi95 model?

The fastest Undi95 model is ReMM SLERP 13B at 23 output tokens per second.

How many Undi95 models are there?

modelgrep tracks 1 Undi95 models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Undi95 rankings

All rankings