If you’ve recently been using the Claude API to build apps or agents, the most noticeable changes are in the console and documentation: the Claude API now has new usage and cost dashboards, release notes have been filled out, and Claude Sonnet 3.5 in the Claude API supports longer outputs. Below, I’ll explain the new features around what developers care about most: “see clearly, write longer, and track.”
Claude API output limit doubled: better delivery for long-form text and code
In the Claude API, Claude Sonnet 3.5’s maximum output tokens increased from 4096 to 8192, making it suitable for returning longer summaries, reports, code generation, and multi-step reasoning results in a single response. To enable extended output in the Claude API, you need to add the specified beta request header in your request: "anthropic-beta": "max-tokens-3-5-sonnet-2024-07-15".
In practice, it’s recommended to reserve “long output” in the Claude API for scenarios that truly need it—for example: generating a complete PRD, outputting a runnable script, or delivering an annotated refactoring plan in one go. For requests that only need a conclusion, it’s still better in the Claude API to control max_tokens to avoid burning your budget on verbose wording.
Claude API Console adds usage and cost dashboards: more transparent billing
With the new “Usage” and “Cost” tabs in the developer console, Claude API consumption can finally be viewed by USD amount, token count, and API key. For teams that share a Claude API key across multiple environments and projects, this makes it faster to pinpoint anomalous calls and the source of sudden cost spikes.
Even more practical is that you can track experimental calls and production calls separately in the Claude API: split API keys across services, then compare token and cost curves in the dashboard. This way, Claude API budget discussions no longer rely on “gut feeling”—you can speak directly with data.


