Lowest-Latency Tencent Models

Quick answer · Updated June 2026

Hunyuan A13B Instruct has the lowest latency of any Tencent model, responding in about 1.1s to first token. Hy3 preview (4.4s) is next.

1.1sLatency

8 t/sSpeed

$0.140Input /M

131KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

Frequently asked

Which Tencent model has the lowest latency?

Hunyuan A13B Instruct has the lowest latency of any Tencent model, responding in about 1.1s to first token. Hy3 preview (4.4s) is next.

What's a good alternative to Hunyuan A13B Instruct?

Hy3 preview (4.4s) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many Tencent models are there?

modelgrep tracks 2 Tencent models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Hy3 preview. 2 of them qualify for this ranking.

More Tencent rankings

Tencent: Smartest LLMs Tencent: Best LLMs for Coding Tencent: Best LLMs for Design & Frontend Tencent: Fastest LLMs Tencent: Cheapest LLMs Tencent: Best Free LLMs Tencent: Best Reasoning LLMs Tencent: Best Vision LLMs Tencent: Best LLMs for Agents Tencent: Best Open-Source LLMs Tencent: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs