Cheap subagents are better when their work is visible

Official Sources

Resource	Description
DeepSeek pricing	Current DeepSeek V4 Flash and V4 Pro per-token rates
Claude Code subagents	Subagents run in their own context window with restricted tools
Codex CLI subagents	Codex subagent workflows for parallelizing larger tasks
AgentCanvas	The board cheap subagents write to

The economics of subagents flipped in 2026. DeepSeek V4 Flash is $0.14 per million input tokens and $0.28 per million output tokens. GLM-5.2 is open-weights and effectively free if you host it. Kimi is in the same band. At those prices you can afford to spin up a dozen sidecar subagents to draft, explore, and sketch - work you would never pay frontier-model prices for.

The reason most people do not do this is not cost. It is that the output of a cheap subagent is usually invisible. It lives in a transcript nobody opens, in a context window that closes when the subagent returns, and the only thing that survives is a one-line summary. Cheap work you cannot inspect is just expensive noise.

The visibility problem

Subagents are designed to isolate context. Each one runs in its own fresh conversation, does its work, and returns a single text result to the parent. The intermediate tool calls and outputs stay inside the subagent. That is the feature: the parent's context stays clean.

It is also the trap. When the subagent is cheap and exploratory, the interesting part is the exploration - the drafts it tried, the options it sketched, the dead ends it hit. All of that gets thrown away by design. You paid $0.004 for a subagent to explore five approaches and you get back "approach 3 looks best" with no evidence.

This is the same dynamic covered in the agent teams playbook: specialization is good, but specialization without a shared surface means every handoff is lossy. The fix for cheap subagents is the same as the fix for expensive ones: give them a place to put the work where a human or another agent can look at it.

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Program-as-Weights Turns Prompts Into Local Fuzzy Functions

Jul 5, 2026 • 7 min read

Claude Sonnet 5 Developer Guide: Migration, API, and Effort Levels

Jul 4, 2026 • 8 min read

Dan Luu's Agentic Coding Notes Point to the Real Bottleneck

Jul 4, 2026 • 8 min read

Image Token Compression Is a Real Agent Cost Lever

Jul 4, 2026 • 8 min read

Make the cheap lane inspectable

The move is to point every cheap subagent at the same AgentCanvas board. Instead of returning a summary, the subagent calls create_html_asset to pin its drafts, create_image_asset to attach sketches, and append_html to stream its reasoning as it goes.

Now the economics work the way they are supposed to:

A DeepSeek subagent drafts three landing-page variants and pins each as an HTML asset. You see all three. Cost: a few cents.
A GLM subagent sketches an architecture and attaches the diagram. You see the diagram, not a description of it. Cost: effectively free.
A Kimi subagent explores a refactor and streams its notes live. You watch it think. Cost: negligible.

The subagent still runs in its own context window, so your main agent's context stays clean. The difference is that the output is on a board instead of trapped in a transcript. When the work is visible, cheap subagents stop being a gamble and start being a pipeline.

When to use the cheap lane

Not every task belongs on a cheap model. The pattern that works:

Drafts and exploration - cheap. Spin up three DeepSeek subagents, each exploring a different direction, all writing to the same board. Pick the winner.
Final implementation and review - expensive. Use Claude Code or Codex for the work that ships. The cost-quality tradeoff for frontier coding is covered in the Fable 5 vs DeepSeek V4 cost-quality breakdown.
Sketched artifacts - cheap. Let a cheap model produce the first pass of a doc, a diagram, or a slide. Promote it to a frontier model only if the first pass is not good enough.

The decision is not really about which model is best. It is about which model is cheap enough that you can run it speculatively without flinching. For the budget end, the DeepSeek V4 budget coding agents guide and the GLM-5.2 cost math walk through the numbers.

Why a board beats a folder

You could argue the same thing is achievable by having subagents write files to a directory. You can. The difference is that a directory is a flat list and a canvas is a layout. When three subagents each produce two drafts, a directory gives you six files with no relationship. A canvas gives you three columns, each with its drafts stacked, and you can see at a glance which lane is winning.

That spatial structure is the whole point of AgentCanvas. It is what turns cheap speculative subagents from a pile of files into a reviewable workspace.

FAQ

What is a cheap subagent?

A subagent running on a low-cost model like DeepSeek V4 Flash, GLM-5.2, or Kimi, used for drafts, exploration, and speculative work where the cost is low enough to run several in parallel.

Why do cheap subagents need visibility?

Because their value is in the exploration, not the summary. Subagents return only a single text result to the parent, so the drafts and sketches they produced are lost unless they are written somewhere persistent.

How does AgentCanvas help?

It gives subagents MCP tools to pin HTML docs, images, and video to a shared board. The subagent's full output stays visible to humans and to other agents instead of being discarded with the subagent's context window.

Does this work with Claude Code subagents?

Yes. Claude Code subagents inherit MCP tools from the parent by default, so a subagent can call the AgentCanvas tools to write its work to the board.

When should I not use a cheap subagent?

For final implementation, security review, and anything that ships directly. Use cheap subagents for the speculative first passes and frontier models for the work that has to be right.

Official Sources

Resource	Description
DeepSeek pricing	Current DeepSeek V4 Flash and V4 Pro per-token rates
Claude Code subagents	Subagents run in their own context window with restricted tools
Codex CLI subagents	Codex subagent workflows for parallelizing larger tasks
AgentCanvas	The board cheap subagents write to

The visibility problem

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Program-as-Weights Turns Prompts Into Local Fuzzy Functions

Jul 5, 2026 • 7 min read

Claude Sonnet 5 Developer Guide: Migration, API, and Effort Levels

Jul 4, 2026 • 8 min read

Dan Luu's Agentic Coding Notes Point to the Real Bottleneck

Jul 4, 2026 • 8 min read

Image Token Compression Is a Real Agent Cost Lever

Jul 4, 2026 • 8 min read

Make the cheap lane inspectable

Now the economics work the way they are supposed to:

A DeepSeek subagent drafts three landing-page variants and pins each as an HTML asset. You see all three. Cost: a few cents.
A GLM subagent sketches an architecture and attaches the diagram. You see the diagram, not a description of it. Cost: effectively free.
A Kimi subagent explores a refactor and streams its notes live. You watch it think. Cost: negligible.

When to use the cheap lane

Not every task belongs on a cheap model. The pattern that works:

Drafts and exploration - cheap. Spin up three DeepSeek subagents, each exploring a different direction, all writing to the same board. Pick the winner.
Final implementation and review - expensive. Use Claude Code or Codex for the work that ships. The cost-quality tradeoff for frontier coding is covered in the Fable 5 vs DeepSeek V4 cost-quality breakdown.
Sketched artifacts - cheap. Let a cheap model produce the first pass of a doc, a diagram, or a slide. Promote it to a frontier model only if the first pass is not good enough.

Why a board beats a folder

That spatial structure is the whole point of AgentCanvas. It is what turns cheap speculative subagents from a pile of files into a reviewable workspace.