Fastest Undi95 Models

Quick answer · Updated June 2026

The fastest Undi95 model is ReMM SLERP 13B at 23 output tokens per second.

23 t/sSpeed

$0.450Input /M

6KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

1U
remm-slerp-l2-13b
JSON$0.450/M · 553ms ttft · 6K ctx
23 t/s
Speed

Frequently asked

What is the fastest Undi95 model?

The fastest Undi95 model is ReMM SLERP 13B at 23 output tokens per second.

How many Undi95 models are there?

modelgrep tracks 1 Undi95 models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Undi95 rankings

Undi95: Smartest LLMs Undi95: Best LLMs for Coding Undi95: Best LLMs for Design & Frontend Undi95: Lowest-Latency LLMs Undi95: Cheapest LLMs Undi95: Best Free LLMs Undi95: Best Reasoning LLMs Undi95: Best Vision LLMs Undi95: Best LLMs for Agents Undi95: Best Open-Source LLMs Undi95: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs