If you’ve recently been using the Claude API to build apps or agents, the most noticeable changes are: it can talk longer, costs are easier to check, and updates are easier to track. Below, I’ll break down these new Claude API features clearly so you can apply them directly to your project.
Claude API Model Update: Sonnet 3.5 Is Faster and Stronger
The Claude API has made Claude Sonnet 3.5 an important new-generation choice. Across multiple evaluations, it surpasses same-tier competitors and the older Opus 3, while maintaining the speed and cost advantages of a mid-tier model. For API scenarios that need “fast responses, consistent quality,” this kind of upgrade is very noticeable. You can first validate the results in the Claude Console workspace, then migrate the same set of prompts into your Claude API calls.
Claude API Long Output Doubled: How to Enable 8192 from 4096
Many people run into Claude API replies getting cut off; fundamentally, it’s because the maximum output token limit isn’t high enough. Now, Claude Sonnet 3.5 in the Claude API supports increasing the maximum output from 4096 to 8192, but you need to explicitly enable the beta capability. To do so, include the request header:
"anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15"


