Google announced the launch of Gemini 3.1 Flash Live, describing it as its “highest-quality audio and speech model” so far, and positioning it as the engine behind the “biggest upgrade yet” to Gemini Live. According to official details, the new version will deliver faster responses and fewer pauses, improving the naturalness and coherence of real-time voice conversations.
Google says Gemini 3.1 Flash Live is better at speech understanding, including recognizing acoustic details such as pitch and speaking speed. It also offers lower latency compared with the previous 2.5 Flash Native Audio. This suggests that in scenarios like multi-turn conversations and instant Q&A, Gemini Live may move closer to a truly real-time interaction experience—closer to “thinking while speaking.”

