
Newest audio model from Google introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.
Newest audio model from Google introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.
Gemini 3.1 Flash Tts ranks #1 of 88 in the Speech Arena with 1215 Elo, based on blind human-preference votes.
Gemini 3.1 Flash Tts runs as a hosted API endpoint on fal.ai (fal-ai/gemini-3.1-flash-tts), offered under a commercial license. No infrastructure needed — call it per generation.