Google has officially launched the Gemini 3.1 Flash Live voice model, calling it its "highest quality audio and voice model to date," which brings major upgrades to Gemini Live and Search Live services. The model is currently available for preview in Google AI Studio through the Gemini Live API, signifying important progress in Google's real-time voice interaction technology.
Compared to the previous generation 2.5 Flash Native Audio, Gemini 3.1 Flash Live more effectively recognizes acoustic details like pitch and rhythm, while reducing latency. It significantly improves environmental noise filtering, better distinguishing speech from background sounds such as traffic or TV. In the Gemini Live app on Android and iOS, users will get faster responses with "reduced awkward pauses," and conversation thread duration is doubled, ensuring coherent thinking during extended brainstorming. Additionally, Gemini Live can now dynamically adjust response length and tone to match the context.

