Just 41 days after releasing Opus 4.7, Anthropic has officially launched Claude Opus 4.8. The new model outperforms GPT-5.5 by over 10 points on the SWE-Bench Pro benchmark and ranks first on the Artificial Analysis Intelligence Index with a score of 61.4, ahead of GPT-5.5 at 60.2. This release is considered a genuine architectural upgrade rather than a simple model iteration.
The standout feature of Opus 4.8 is the introduction of Dynamic Workflows, a tool that enables Claude to plan large tasks and distribute work across dozens to hundreds of parallel subagents, then verify outputs and return complete results. The model also achieves a 4x improvement in honesty, meaning it more accurately communicates its own uncertainty to users. In long-context tasks, Opus 4.8 significantly outperforms both GPT-5.5 and Gemini 3.1 Pro.

