In this update, ChatGPT doesn’t just become “better at chatting”—it also fits more naturally into everyday workflows: one-click launch on desktop, more natural voice conversations, and faster retrieval of past chats. Below, we’ll break down the most practical changes by usage scenario.
More human-like multimodal conversations: use text, images, and voice together
Built around GPT-4o, ChatGPT upgrades from being “good only at typing” to a multimodal assistant that can handle text, images, and voice at the same time. You can ask questions while sending screenshots, letting ChatGPT give steps or suggestions directly based on what’s on screen, instead of having to explain the context back and forth.
The voice conversation experience is also closer to everyday communication, making it suitable for speaking practice, on-the-spot Q&A, or quickly organizing your thoughts. For people who need accessibility support, ChatGPT’s image understanding and voice interaction are also more practically helpful.
ChatGPT for Mac: Option + Space, on call anytime
ChatGPT has launched a Mac desktop app. The most useful point is that you can quickly summon it with Option + Space, without switching to a browser to find the right tab. When you need to revise an email, review file contents, or ask a quick coding question, ChatGPT can plug into your workflow more smoothly on the desktop.
On desktop, you can also upload files and photos for ChatGPT to summarize, extract key points, or help organize materials. For people who like to “work and ask as they go,” this change in access point affects efficiency more than model parameters do.


