If you regularly use Claude for API calls or tweak prompts in the console, the most noticeable parts of this update are: longer answers, easier parameter tuning, and more transparent billing. This article breaks down several key new features in the Claude Workbench and the Claude API so you can get started right away by following along.
Claude Sonnet 3.5 long output: the right way to go from 4096 to 8192
In the Claude API, the maximum output token limit for Claude Sonnet 3.5 has been increased from 4096 to 8192, making it more friendly for long-form summarization, code generation, and multi-step reasoning. To enable long output, you need to add a specific beta request header to your request, rather than only changing max_tokens.
The official method is to add the request header: "anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15". It’s recommended to enable long output only when Claude needs to “finish an entire section,” to avoid unnecessary token consumption.
Workbench prompt generator: state the task first, then let Claude fill in the structure
The Claude console Workbench now includes a prompt generator, and the workflow is straightforward: you first describe the task in one sentence (for example, “Classify inbound customer support emails and provide handling recommendations”), and Claude will produce a more complete prompt framework. It will usually also add role, input/output formats, boundary conditions, and examples.
This feature is especially useful for team collaboration: treat the prompt Claude generates as a “first-draft template,” then fine-tune it with your business terminology—it’s more reliable than writing prompts from scratch.


