
TL;DR
Claude Sonnet 5 lands near Opus 4.8 on some tasks for a fraction of the price - but a new tokenizer runs about 30 percent more tokens. Here is the upgrade decision for builders, with the numbers.
Direct answer
Claude Sonnet 5 lands near Opus 4.8 on some tasks for a fraction of the price - but a new tokenizer runs about 30 percent more tokens. Here is the upgrade decision for builders, with the numbers.
Best for
Developers comparing real tool tradeoffs before choosing a stack.
Covers
Verdict, tradeoffs, pricing signals, workflow fit, and related alternatives.
Anthropic shipped Claude Sonnet 5 on June 30, 2026, and made it the default model for Free and Pro. The pitch is simple: performance close to Opus 4.8 on agentic and coding work, at Sonnet prices. It is a strong upgrade, but there is one catch in the fine print that changes the cost math. Here is the builder decision.
Sonnet 5 (claude-sonnet-5) is available same-day across the Claude API, Claude Code, Amazon Bedrock, Google Vertex, and Microsoft Foundry. It is the default on Free and Pro and available on Max, Team, and Enterprise. Anthropic calls it "the most agentic Sonnet yet" - built to plan, use browsers and terminals, and run autonomously.
Key specs:
effort (defaults to high on the API and in Claude Code)From Anthropic's official system card (Sonnet 5 at adaptive thinking, max effort, 5-trial average):
| Benchmark | Sonnet 5 | Sonnet 4.6 | GPT-5.5 | Gemini 3.5 Flash |
|---|---|---|---|---|
| SWE-bench Verified | 85.2% | - | - | - |
| SWE-bench Pro | 63.2 | 58.1 | 58.6 | 55.1 |
| Terminal-Bench 2.1 | 80.4 | 67.0 | 83.4 | 76.2 |
| BrowseComp | 84.7 | 76.2 | 84.4 | - |
| Humanity's Last Exam (with tools) | 57.4 | 46.8 | 52.2 | - |
| OSWorld-Verified | 81.2 | 78.5 | 78.7 | 78.4 |
| FrontierCode v1 | 38.8 | 15.1 | 25.5 | - |
| GDPval-AA v2 (Elo) | 1618 | 1395 | 1509 | 1357 |
The story is coding and agents. FrontierCode more than doubled over Sonnet 4.6 (15.1 to 38.8), SWE-bench Pro and BrowseComp both jumped, and it leads GPT-5.5 and Gemini 3.5 Flash on most of the agentic and knowledge benchmarks. Two spots where a competitor leads: Terminal-Bench (GPT-5.5 via Codex CLI) and AutomationBench (Gemini 3.5 Flash).
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jul 1, 2026 • 8 min read
Jul 1, 2026 • 8 min read
Jul 1, 2026 • 8 min read
Jul 1, 2026 • 6 min read
Sonnet 5 introductory pricing is $2 per 1M input and $10 per 1M output through August 31, 2026, then $3 / $15 after (the same per-token rate as Sonnet 4.6). For reference, Opus 4.8 is $5 / $25. So on tasks where Sonnet 5 lands close to Opus 4.8, you get comparable results for roughly half the output price.
Sonnet 5 uses a new tokenizer that produces about 30 percent more tokens for the same text (Anthropic's own footnote gives a 1.0 to 1.35x range by content type). The per-token price is unchanged, but that means an equivalent request can cost slightly more than it did on Sonnet 4.6, and your max_tokens budgets may need re-checking. "Same per-token price" is not the same as "same per-task cost." Model this before you migrate a high-volume workload.
Anthropic's system card is refreshingly direct: overall performance is "comparable to Sonnet 4.6" and Sonnet 5 "does not advance our capability frontier" against Opus and Mythos-class models. The real gains are concentrated in agentic and coding tasks, plus it is the first Sonnet-tier model with real-time cyber safeguards on by default (and it is deliberately weak at cyber-offense by design).
Upgrade now if you run coding agents, autonomous workflows, or browser and terminal tasks. The FrontierCode and SWE-bench gains are real, and near-Opus quality at Sonnet prices is a genuine cost win for agent-heavy products.
Hold or test first if your workload is high-volume and cost-sensitive - the tokenizer inflation can quietly raise per-task cost, so measure on your own traffic before flipping the default.
Migration itself is close to a drop-in: swap the model ID, remove manual thinking budgets and non-default sampling params (they now 400), and re-verify your max_tokens because of the tokenizer change.
Yes on agentic and coding tasks - it beats Sonnet 4.6 across Anthropic's benchmark suite, with FrontierCode more than doubling (15.1 to 38.8). Anthropic notes overall quality is otherwise comparable, so the biggest wins are concentrated in coding and agents rather than a blanket jump.
Introductory pricing is $2 per million input tokens and $10 per million output through August 31, 2026, then $3 / $15. That is the same per-token rate as Sonnet 4.6 and cheaper than Opus 4.8 ($5 / $25).
Sonnet 5 uses a new tokenizer that generates roughly 30 percent more tokens for the same text. The per-token price is unchanged, so an equivalent request can cost a bit more per task than on Sonnet 4.6.
No. It is close to a drop-in: change the model ID, drop manual extended-thinking budgets and non-default sampling parameters (both now return 400), and re-check your max output token budgets because of the tokenizer change.
Read next
The orchestrator is the most important model choice in an agent fleet. A fair head-to-head between Fable 5 and Opus 4.8 for that role, with a decision matrix by run length, budget, compliance, and refusal-handling tolerance.
8 min readAnthropic releases Claude Sonnet 5 with improved agentic capabilities, better tool use, and an introductory pricing deal. Here's what developers need to know.
6 min readAnthropic's Claude Haiku 4.5 delivers Sonnet 4-level coding performance at one-third the cost and twice the speed. Here is what developers need to know.
5 min readTechnical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding m...
View ToolAnthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolAnthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token c...
View ToolAnthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 pe...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppUse opus, sonnet, haiku, and best to switch models easily.
Claude CodeHybrid mode: Opus for planning, Sonnet for execution.
Claude CodeExtended context window for Opus and Sonnet on supported plans.
Claude Code
Open Design: Open-Source n8n App That Turns Any Website into a Brand Kit, Design System, HTML + Images The video introduces Open Design, an MIT-licensed full-stack template that combines AI and n8n a...

Anthropic Suspends Fable 5 & Mythos 5 After US Export Control Directive (Jailbreak Concerns) Anthropic announced that the US government issued export control directives requiring it to suspend Fable ...

Claude Fable 5 Released: Benchmarks, Pricing, Availability, and Real-World Examples Anthropic has released Claude Fable 5, the first general-use “Mythos class” model, and the video reviews the announ...

Anthropic's most capable model launched, got suspended by a US export-control order, and returned today. Here is what Fa...

Anthropic releases Claude Sonnet 5 with improved agentic capabilities, better tool use, and an introductory pricing deal...

Anthropic shipped two names for one architecture on June 9, 2026. Here is what separates Fable 5 from Mythos 5, who can...

Anthropic's Claude Haiku 4.5 delivers Sonnet 4-level coding performance at one-third the cost and twice the speed. Here...

Standing up a fleet of Fable 5 agents is the easy part. This is the operations layer - data retention rules, refusal-rat...

The orchestrator is the most important model choice in an agent fleet. A fair head-to-head between Fable 5 and Opus 4.8...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.