Anthropic recently delivered a major upgrade to its Claude 3.5 Sonnet model, adding the ability to control a computer and achieving another leap in coding performance. The update also introduces the all-new Claude 3.5 Haiku model, further diversifying its product offerings. Here’s a look at the standout features.
Claude 3.5 Sonnet Gains Computer Control
The most eye-catching addition to the new Claude 3.5 Sonnet is its ability to operate a computer. Anthropic built a dedicated API that lets the model perceive a computer interface and interact with it much like a human would. Developers can integrate the API to have Claude perform tasks such as moving the cursor, clicking buttons, and filling out forms.
In the OSWorld benchmark, Claude 3.5 Sonnet scored 14.9% in screenshot-only mode, significantly outperforming other AI systems. While the model still faces challenges with scrolling or dragging, well-known companies like Asana and Replit are already testing the feature. This capability opens up new possibilities for automating repetitive workflows.
Significant Coding Improvements and Performance Optimization
The updated Claude 3.5 Sonnet delivers a major leap in coding performance. On the SWE-bench Verified test, its score jumped from the previous 33.4% to 49.0%, surpassing reasoning models including o1-preview. Early client feedback shows that GitLab saw a 10% improvement in reasoning for DevSecOps tasks without any increase in latency when using the model.

