If you’ve recently been working on long-form generation, complex reasoning, or multi-step code output, this Claude API update is worth following up on immediately. The key changes include: doubling the maximum output of Claude Sonnet 3.5, adding prompt generation and evaluation modes to the Claude Console Workbench, and a more intuitive usage and cost dashboard.
Claude API: Sonnet 3.5 output limit doubled to 8,192
In the Claude API, the maximum output token limit for Claude Sonnet 3.5 has been increased from 4,096 to 8,192, making it better suited for producing complete solutions, long documents, or longer code blocks in one go. To enable extended output, you need to add a specific beta request header to your request, rather than simply increasing the max_tokens parameter.
The official approach is to add the request header: "anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15". For teams frequently troubled by “truncated output,” this step in the Claude API can significantly reduce the cost of splitting conversations and stitching results together.
Claude Console Workbench: The prompt generator is better suited for production scenarios
The Claude Console Workbench adds a new “prompt generator.” You simply describe the task in natural language—for example, “classify and route incoming customer support requests”—and it will help structure a more complete prompt for you. For those who need to integrate the Claude API into business workflows, this saves time compared with writing prompts from scratch.
It’s recommended to include your input/output formats, boundary conditions, and failure fallback strategies in the task description as well, then have the generator produce a prompt template that can be used directly with the Claude API.


