If you’re using the Claude API for customer support, content generation, or code analysis, this recent wave of updates is very practical: Claude Sonnet 3.5’s maximum output has been expanded, and the Claude Console Workbench has filled in prompt generation and evaluation capabilities. Below, from a “ready to use right away” perspective, I’ll clearly walk through the key new features of the Claude API in one go.
Sonnet 3.5 Extended Output: from 4096 to 8192
In the Claude API, Claude Sonnet 3.5’s maximum output token limit has doubled from 4096 to 8192, which is especially friendly for long-form summarization, long code explanations, and drafting batches of emails. To enable extended output, you need to add a beta request header to your request, rather than only changing the max_tokens parameter.
The official approach is: when calling the Claude API, include the request header "anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15". It’s recommended that you first validate response length and cost changes on non-critical traffic, then gradually ramp up, to avoid cost fluctuations caused by maxing out long outputs all at once.
Claude Console Workbench Upgrade: the Prompt Generator saves more time
The Claude Console Workbench has added a “Prompt Generator.” You simply describe the task objective (for example, “classify and handle inbound customer support requests”), and it will help you produce a more complete prompt structure. For teams without dedicated prompt engineering, this is like compressing drafting time from “hours” down to “minutes.”
More importantly, it makes Claude API adoption more standardized: for the same task you can quickly get a “reusable template,” and later you only need to replace variable fields. You can also migrate the generated prompt directly into code, reducing inconsistencies between offline and online setups.
Evaluation Mode Launch: side-by-side comparison of prompt outputs, reducing guesswork
The Workbench has added “Evaluation Mode,” which supports comparing the outputs of two or more prompts side by side and rating Claude’s output on a 5-point scale. This feature is critical for iterating on Claude API prompts because it turns “it feels better” into a comparable result that can be recorded and traced back.


