Claude API New Features Quick Overview: Sonnet Extended Output, Evaluation Mode, and Usage Dashboard

If you’ve recently been building chat applications or automation workflows, several Claude API updates will directly affect “how long it can write, how to tune prompts, and how to control costs.” This article clarifies three areas—extended output, Workbench tools, and usage statistics—so you can apply them to your existing projects right away.

Sonnet Extended Output: Generate Longer Content in One Go

Claude API has increased the maximum output of Claude Sonnet 3.5 from 4096 to 8192 tokens, making long reports, long emails, or multi-part code generation much smoother. To enable extended output, you need to add a specific beta header to your request: anthropic-beta: max-tokens-3-5-sonnet-2024-07-15.

A practical approach is: first set max_tokens to match your target length, then constrain the output with a segmented structure (subheadings + bullet points) to avoid “writing a lot but drifting off-topic.” When you use the Claude API for chained tasks like summarization + rewriting + polishing, extended output can reduce the number of multi-round requests.

Workbench Enhancements: A More Useful Prompt Generator and Evaluation Mode

In the Claude Console Workbench, the new “Prompt Generator” is great for quickly drafting task templates: you describe the goal (e.g., “categorize incoming customer support requests”), and it produces a prompt structure you can paste directly into the Claude API. For new projects, this saves time compared with starting from a blank prompt.

“Evaluation Mode” is suited for A/B testing: run two prompts side by side on the same batch of inputs, then compare output quality using a 5-point rating scale. You can turn evaluation results into team standards, so all subsequent Claude API calls use the same baseline prompt.

Usage and Cost Dashboard: More Intuitive Tracking by Dollars and Tokens

The developer console now includes “Usage” and “Cost” tabs that let you view consumption by USD amount, token count, and API key. For teams sharing the Claude API across multiple environments (staging/production) or multiple business lines, this view helps you quickly pinpoint “which key is spending fast.”

I recommend using it for two things: first, set budget thresholds (cap peaks first, then optimize prompts); second, condense prompts for high-frequency requests into shorter versions to reduce unnecessary repeated context.

Release Notes and Developer Resources: No More “Guessing” About Updates

Claude API documentation now includes more comprehensive release notes, making it possible to trace changes across the API, console, and Claude apps end to end. When production behavior changes, checking the release notes is usually faster than blindly modifying code.

In addition, the official team provides courses on Claude API fundamentals, tool usage, and more, and has expanded the Cookbook (covering citations, RAG, classification, and other topics). If your Claude API project is moving into a maintainable phase, these materials can help you go from “it works” to “stable and iteratively improvable.”

Sonnet Extended Output: Generate Longer Content in One Go

Workbench Enhancements: A More Useful Prompt Generator and Evaluation Mode

Usage and Cost Dashboard: More Intuitive Tracking by Dollars and Tokens

Release Notes and Developer Resources: No More “Guessing” About Updates

Search articles

ChatGPT Pro Subscription | 30% Off | Credited in 1 Minute | Renewal Supported

Spotify Premium 3-Month Subscription | $10 Top-Up | For Your Own Account | Ad-Free Offline Listening

Popular Articles

Some of the best ChatGPT prompts—methods that can truly boost efficiency by 10x

Claude Code Installation Keeps Failing? A Step-by-Step Guide to Fix the Setup in 3 Steps

ChatGPT, Claude, Gemini, and Midjourney output fail-safe troubleshooting checklist and KISS prompt tips

An efficient ChatGPT + Claude + Gemini + Midjourney workflow to solve inconsistent outputs and rewrite meltdowns

ChatGPT and Claude always miss the point: three questioning techniques to make AI instantly understand your needs