The focus of this ChatGPT update is very clear: to make conversations feel more like “communicating with a person.” GPT-4o brings text, voice, and images into a single unified experience. Combined with ChatGPT’s memory feature and a controllable toggle, ChatGPT not only chats better—it also understands your preferences and context more deeply.
GPT-4o Takes the Stage: Text, Voice, and Visuals Finally Unified
GPT-4o is the core upgrade behind ChatGPT’s multimodal capabilities: text input, voice interaction, and image understanding can all happen within the same conversation. You can throw screenshots, photos, or charts to ChatGPT, and it can explain the content directly and continue by asking follow-up questions. For most everyday scenarios, ChatGPT’s “ways of taking in information” are closer to how humans work, and the cost of communication is noticeably lower.
Advanced Voice Mode: Smoother, and Better for Real-Time Situations
For many people, the change in their impression of ChatGPT comes from voice conversations becoming more natural—pauses and turn-taking feel more like real communication. With GPT-4o’s capabilities, ChatGPT is also better suited for tasks like real-time translation, spoken-language practice, and dictating and organizing meeting highlights—workflows where you “talk while it processes.” For mobile users, updates like this feel more tangible than a simple increase in model parameters.
ChatGPT Memory: It Remembers, But It’s Up to You
ChatGPT’s memory feature saves long-term preferences you explicitly express, such as your preferred tone, work background, or fixed formats, so you don’t have to explain everything from scratch in future conversations. According to OpenAI, ChatGPT will notify you when memory is updated and provides management and control options. You can view and delete specific memories, or turn off memory to prevent ChatGPT from continuously accumulating personal preferences.
Understanding Images and Files: Turning “Seeing” Into Actionable Steps
When ChatGPT can read images and files, the value isn’t in “recognition” itself, but in being able to convert content into the next action. For example, you can give ChatGPT a one-page report and have it extract conclusions, list risk points, and then generate a summary or an email draft in your preferred format. GPT-4o enables ChatGPT to complete “review content—ask key questions—produce output” within the same conversation, making the flow more coherent.
How to Use It More Comfortably: Two Settings Tips
If you want ChatGPT to better match your personal habits, start by telling it your fixed preferences in one or two sentences, then watch whether ChatGPT triggers a memory prompt and confirm the content. On the other hand, for privacy-sensitive or temporary projects, it’s recommended to turn off ChatGPT memory, or clear related memories after you’re done. This way, you can enjoy the natural interaction brought by GPT-4o while keeping ChatGPT’s “understanding of you” within an acceptable range.