Getting Started with ChatGPT GPT-4o: New Changes in Voice Translation and Memory Controls

The focus of this update to ChatGPT is very clear: using GPT-4o to integrate text, voice, and image capabilities into a single conversation. For everyday users, ChatGPT feels more like an “on-call assistant” rather than a tool that can only do typed Q&A.

GPT-4o’s “all-in-one” conversations: use text, voice, and images together

The “o” in GPT-4o comes from omni (all-purpose), meaning ChatGPT is no longer only good at text, but brings audio, visuals, and text reasoning into the same workflow. You can have ChatGPT look at images, read files, and then explain things to you in a more natural way—all within the same conversation. Compared with older models, this multimodal integration reduces the cost of switching and makes the pace of communication smoother.

Smoother voice interaction + real-time translation, making cross-language communication easier

ChatGPT’s voice conversations now feel closer to real human interaction: you can speak to ask follow-up questions, interrupt, or add conditions, and ChatGPT will follow the context. Translation is no longer just “translating a passage of text,” but also supports quick switching between different languages, making it suitable for real-time interpreting-style communication. For business trips, meetings, or online collaboration, ChatGPT’s real-time translation can noticeably reduce back-and-forth confirmation.

Desktop productivity improvements: quick launch, chat search, and discussing files

If you often work on a computer, ChatGPT’s desktop app will be more convenient: on Mac, you can press Option + Space to bring up ChatGPT directly, without first opening a browser to find a tab. When handling emails, screenshots, and documents in daily work, tossing the materials to ChatGPT and continuing the discussion feels more seamless. Another practical feature is chat history search—when you want to find an old conclusion, searching keywords directly in ChatGPT is faster than scrolling manually.

Memory features and control options: understands you better, but you can turn it off anytime

To make responses better tailored to individual needs, ChatGPT has introduced a “memory” mechanism: one part is “saved memories” that you explicitly ask it to store, and another part is related to “chat history,” which helps make later conversations more coherent. In settings, you can separately turn off “saved memories” or “chat history,” and you can also view, delete individual memories, or clear all memories. When you need to leave no trace at all, use Temporary Chat to talk with ChatGPT—its content won’t be added to history and won’t update memory.

GPT-4o’s “all-in-one” conversations: use text, voice, and images together

Smoother voice interaction + real-time translation, making cross-language communication easier

Desktop productivity improvements: quick launch, chat search, and discussing files

Memory features and control options: understands you better, but you can turn it off anytime

Search articles

ChatGPT Pro Subscription | 30% Off | Credited in 1 Minute | Renewal Supported

Spotify Premium 3-Month Subscription | $10 Top-Up | For Your Own Account | Ad-Free Offline Listening

Popular Articles

Some of the best ChatGPT prompts—methods that can truly boost efficiency by 10x

Claude Code Installation Keeps Failing? A Step-by-Step Guide to Fix the Setup in 3 Steps

ChatGPT, Claude, Gemini, and Midjourney output fail-safe troubleshooting checklist and KISS prompt tips

An efficient ChatGPT + Claude + Gemini + Midjourney workflow to solve inconsistent outputs and rewrite meltdowns

ChatGPT and Claude always miss the point: three questioning techniques to make AI instantly understand your needs