After GPT-4o launched, ChatGPT became “more all-around”
The core of this ChatGPT upgrade is GPT-4o, where “o” stands for omni (all-around). It’s no longer only good at text; it integrates text, image, and voice understanding into a single reasoning system, making ChatGPT conversations more natural and faster to respond.
For most users, the most obvious change is: within the same conversation, you can have ChatGPT look at images while listening to you, then reply in text or voice—the interaction cost is noticeably lower.
Real-time translation and smoother voice conversations: Cross-language communication feels more like a real person
In the past, ChatGPT could translate too, but GPT-4o places more emphasis on conversational, real-time switching: within the same conversation, you can have ChatGPT rapidly go back and forth between multiple languages, close to an interpreting experience—very effortless for hosting overseas clients or communicating while traveling.
At the same time, ChatGPT’s voice mode is also being continuously upgraded. Officially, more lifelike voice-reply capabilities are being rolled out gradually; if you notice voice quality varies by account, it’s likely because the feature is being released in batches.
Better on desktop: One-key summon and chat search are back
ChatGPT has released a macOS desktop app. The biggest highlight is summoning it with a keyboard shortcut (Option + Space), so you no longer need to open a browser and hunt for a tab. You can also drop photos and files directly into ChatGPT on desktop for processing—making summaries, extracting key points, or organizing to-dos becomes more convenient.


