Fastest Tencent Models

Quick answer · Updated June 2026

The fastest Tencent model is Hy3 preview at 48 output tokens per second. Hunyuan A13B Instruct (7 t/s) is next.

48 t/sSpeed

41.9Intelligence

$0.063Input /M

262KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

Frequently asked

What is the fastest Tencent model?

The fastest Tencent model is Hy3 preview at 48 output tokens per second. Hunyuan A13B Instruct (7 t/s) is next.

What's a good alternative to Hy3 preview?

Hunyuan A13B Instruct (7 t/s) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many Tencent models are there?

modelgrep tracks 2 Tencent models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Hy3 preview. 2 of them qualify for this ranking.

More Tencent rankings

Tencent: Smartest LLMs Tencent: Best LLMs for Coding Tencent: Best LLMs for Design & Frontend Tencent: Lowest-Latency LLMs Tencent: Cheapest LLMs Tencent: Best Free LLMs Tencent: Best Reasoning LLMs Tencent: Best Vision LLMs Tencent: Best LLMs for Agents Tencent: Best Open-Source LLMs Tencent: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs