Recent Claude API updates are more focused on improvements “developers can use in everyday work”: longer outputs, a smoother console experience, and clearer usage and cost tracking. Below, we’ll quickly explain the most impactful changes—how to use the new Claude API features and who they’re for.
Extended output: Claude Sonnet 3.5 up to 8192 tokens
In the Claude API, the maximum output limit for Claude Sonnet 3.5 has been increased from 4096 to 8192 tokens, making it better suited for long-form generation, complex code completion, and multi-step reasoning summaries. For projects where “responses keep getting cut off,” this upgrade is very noticeable in practice.
Enabling it is straightforward: just include the specified beta request header in your request—for example, set anthropic-beta to max-tokens-3-5-sonnet-2024-07-15. If you’ve already set max_tokens very high in the Claude API but outputs are still short, the first thing to check is whether this header is missing.
A more usable Workbench: Prompt Generator and Evaluation Mode
The Workbench in the Claude Console now includes a “Prompt Generator.” You simply describe the task objective (such as classifying tickets, extracting fields, or generating marketing copy), and it will produce a more complete prompt structure. For people just starting to write system prompts or needing quick templates, this is much easier than building prompts from scratch.
Another practical addition is “Evaluation Mode”: you can run the same task side by side with multiple prompts, then rate the outputs on a 5-point scale. When doing A/B prompt testing for the Claude API, you no longer need to build your own comparison tool, and it’s easier for teams to align on a consistent standard.


