TL;DR
Anthropic's Claude Haiku 4.5 delivers Sonnet 4-level coding performance at one-third the cost and twice the speed. Here is what developers need to know.
Read next
Anthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer use. The release brings significant price cuts, efficiency gains, and enough au...
5 min readAnthropic's Sonnet 4.6 narrows the gap to Opus on agentic tasks, leads computer use benchmarks, and ships with a beta million-token context window. Here's what actually changed.
6 min readAnthropic's Claude Code now supports sub agents - specialized AI workers you can deploy for specific development tasks. Instead of cramming every instruction into a single system prompt, you build a ...
6 min readFive months ago, Claude Sonnet 4 was state-of-the-art. Now Claude Haiku 4.5 matches its coding performance at one-third the cost and more than twice the speed.
For model-selection context, compare this with Claude Code Agent Teams, Subagents, and MCP: The 2026 Playbook and Why Skills Beat Prompts for Coding Agents in 2026; model quality matters most when it is tied to a concrete coding workflow.
That is not marketing spin. On SWE-bench Verified - the benchmark that measures performance on real-world GitHub issues - Haiku 4.5 sits right alongside models that were considered frontier just months earlier. Anthropic released it on October 15, 2025, and it immediately changed the math on which model to use for what.
| Metric | Haiku 4.5 | Sonnet 4 | Delta |
|---|---|---|---|
| SWE-bench Verified | Near-Sonnet 4 | Frontier (at release) | Comparable |
| Speed | 2x+ faster | Baseline | Major improvement |
| Cost (input) | $1/M tokens | $3/M tokens | 3x cheaper |
| Cost (output) | $5/M tokens | $15/M tokens | 3x cheaper |
| Computer use | Surpasses Sonnet 4 | Strong | Haiku wins |
The pricing is $1 per million input tokens and $5 per million output tokens. For context, that means a typical coding session with 50K input tokens and 10K output tokens costs about $0.10. Run that same session on Sonnet 4.5 and you are paying significantly more.
Sub-agent orchestration. This is the killer use case. Sonnet 4.5 breaks down a complex problem into a multi-step plan, then dispatches a team of Haiku 4.5 instances to execute subtasks in parallel. You get frontier-level planning with fast, cheap execution. Claude Code uses this pattern heavily - Haiku 4.5 runs as the sub-agent model by default.
# In Claude Code, Haiku 4.5 powers sub-agents automatically
# The main agent (Sonnet/Opus) orchestrates, Haiku executes
claude "Refactor the auth module and update all tests"
# -> Opus plans the refactor
# -> Multiple Haiku 4.5 sub-agents execute file changes in parallel
Real-time applications. Chat assistants, customer service agents, pair programming tools - anything where latency matters. Haiku 4.5 responds fast enough that the AI feels instant rather than sluggish.
Computer use. Surprisingly, Haiku 4.5 surpasses Sonnet 4 on computer use tasks. If you are building desktop automation, the small model is actually the better choice.
High-volume batch processing. At 3x cheaper than Sonnet, running Haiku 4.5 on thousands of files, PRs, or code reviews becomes economically viable in ways that frontier models are not.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Through the API, just swap the model name:
import anthropic
client = anthropic.Anthropic()
response = client.messages.create(
model="claude-haiku-4-5",
max_tokens=1024,
messages=[
{"role": "user", "content": "Review this function for bugs..."}
]
)
In Claude Code, Haiku 4.5 is already integrated as the default sub-agent model. You do not need to configure anything - it handles the fast, parallel execution tasks while the primary model (Opus or Sonnet) handles planning and complex reasoning.
On claude.ai, Haiku 4.5 is available in the model selector for all users, including free tier.
The model lineup has a clear hierarchy now:
The practical pattern most teams settle on: Opus or Sonnet as the orchestrator, Haiku 4.5 as the executor. Planning happens once at the top. Execution happens many times in parallel at the bottom. This gives you the best of both worlds.
Augment reported that Haiku 4.5 achieves 90% of Sonnet 4.5's performance in their agentic coding evaluation. Warp called it "a leap forward for agentic coding, particularly for sub-agent orchestration." Vercel noted that "just six months ago, this level of performance would have been state-of-the-art on our internal benchmarks."
The consensus is the same from every direction: the speed-intelligence tradeoff that used to define small models is disappearing. Haiku 4.5 is not a compromise. It is a genuinely capable model that happens to be fast and cheap.
Haiku 4.5 represents a pattern in AI development. Today's frontier becomes tomorrow's commodity. The model that was cutting-edge in May is the small, cheap option by October. This compression benefits developers enormously - the capabilities you are building against keep getting cheaper to run.
For teams building on Claude, the practical takeaway is straightforward: audit your model usage. Anything running on Sonnet 4 that does not require frontier reasoning can likely drop to Haiku 4.5 with no quality loss and 3x cost savings.
Yes. It matches Sonnet 4 on SWE-bench Verified, which tests real-world GitHub issue resolution. For most coding tasks - code review, bug fixes, test generation, refactoring - Haiku 4.5 delivers results that are indistinguishable from what the larger models produce.
Haiku 4.5 outperforms both on coding benchmarks while maintaining competitive pricing. Its particular strength is agentic workflows - multi-step tasks where the model needs to use tools, navigate codebases, and make sequential decisions.
You can, but you will hit its limits on complex architectural reasoning and novel problem-solving. The recommended pattern is to use it alongside a larger model - let Sonnet or Opus handle the hard thinking, and Haiku handles the execution.
Haiku 4.5 supports a 200K token context window, same as the larger Claude models. This means it can process entire codebases, long documents, and extended conversation histories without truncation.
Yes. Full tool use, function calling, computer use, and all Claude API features are supported. There are no capability restrictions compared to larger models - only differences in reasoning depth on complex tasks.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolAnthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding m...
View ToolAnthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token c...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppA complete, citation-backed Claude Code course with setup, prompting systems, MCP, CI, security, cost controls, and capstone workflows.
ai-developmentAutomatic reuse of cached context for substantial cost reduction.
Claude CodeUse opus, sonnet, haiku, and best to switch models easily.
Claude Code
Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Claude Design by Anthropic: Generate a Design System From Your Repo + Build High-Fidelity UI Fast The video reviews Claude Design by Anthropic, calling it a highly differentiated product, and demonst...

Anthropic Releases Claude Opus 4.7: Benchmarks, Vision Upgrades, Memory, Pricing & New Claude Code Features Anthropic has released Opus 4.7, and the video covers the announcement, benchmark results, ...

Anthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer us...

Anthropic's Sonnet 4.6 narrows the gap to Opus on agentic tasks, leads computer use benchmarks, and ships with a beta mi...

Anthropic's Claude Code now supports sub agents - specialized AI workers you can deploy for specific development tasks....

Claude Code is Anthropic's terminal-based AI agent that ships code autonomously. Complete guide: install, CLAUDE.md memo...

Two platforms, two philosophies. Here is how Anthropic and OpenAI compare on APIs, SDKs, documentation, pricing, and the...

A practical operational guide to Claude Code usage limits in 2026: plan behavior, API key pitfalls, routing choices, and...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.