The best text to speech model is Gemini 3.1 Flash Tts, rated 1215 Elo (#1 of 88) in the Speech Arena by human preference. 1 of the 4 models below carry arena rankings.
AI voice and text-to-speech models ranked by human-preference Elo from the Artificial Analysis Speech Arena.