← Back to feed
AI

Google ships Gemini 3.1 Flash TTS — 70+ languages, Elo 1211 on Artificial Analysis leaderboard

Google rolled out Gemini 3.1 Flash TTS starting April 15 across Gemini API, AI Studio, Vertex AI, and Google Workspace (Google Vids). It supports more than 70 languages, natural-language 'audio tags' for controlling vocal style, pace and delivery mid-sentence, native multi-speaker dialogue, and scene direction. Every generated clip is watermarked with SynthID. On the Artificial Analysis TTS leaderboard, Flash TTS landed an Elo score of 1211, placing it in the 'most attractive quadrant' for quality-per-dollar and directly challenging ElevenLabs' pricing premium. Google did not publish exact latency or pricing numbers.

googledeepmindgeminittselevenlabs

Why it matters

TTS has been ElevenLabs' moat for 18 months. An Elo of 1211 on Artificial Analysis plus 70-language coverage bundled into the Gemini API at Google's usual price aggression threatens that moat directly. For any product that ships conversational agents — support, accessibility, localization, content narration — the default TTS vendor conversation gets rewritten this quarter. Expect ElevenLabs to either cut API pricing by 40%+ or pivot toward voice-cloning/creator tooling.

Impact scorecard

7.6/10
Stakes
7.5
Novelty
7.5
Authority
8.5
Coverage
7.0
Concreteness
8.0
Social
7.0
FUD risk
2.0
Coverage22 outlets · 3 tier-1
The Verge, TechCrunch, Ars Technica, VentureBeat
X / Twitter6,000 mentions
@GoogleDeepMind · 8,000 likes
Reddit900 upvotes
r/MachineLearning
r/MachineLearning, r/singularity

Trust check

high

Primary-source Google announcement; Artificial Analysis leaderboard is independent and verifiable. Elo 1211 is corroborated live. No FUD flags.

Primary source ↗