Prompt caching reuses large, stable parts of your prompt across turns so you don't pay to re-tokenize them every time.

What it does

Claude Code marks static context - system prompts, CLAUDE.md, loaded files - as cacheable. Subsequent turns that reuse the same prefix pay a fraction of the normal per-token cost. This is why long sessions don't cost linearly more per turn as context grows.

When to use it

Any session with meaningful CLAUDE.md or rule files - caching is already on by default.
Heavy repos where large file reads recur turn after turn.
Long debugging sessions where you want predictable costs.
API-integrated workflows where per-turn cost matters.

Gotchas

Cache hits require the prefix to be byte-identical. Small CLAUDE.md edits invalidate the cache.
Cached entries expire - very long gaps between turns pay full price again.
Caching is configured per model. Check the model config doc if your numbers look off.

Official docs: https://code.claude.com/docs/en/model-config.md#prompt-caching-configuration

Prompt caching reuses large, stable parts of your prompt across turns so you don't pay to re-tokenize them every time.

What it does

When to use it

Any session with meaningful CLAUDE.md or rule files - caching is already on by default.
Heavy repos where large file reads recur turn after turn.
Long debugging sessions where you want predictable costs.
API-integrated workflows where per-turn cost matters.

Gotchas

Cache hits require the prefix to be byte-identical. Small CLAUDE.md edits invalidate the cache.
Cached entries expire - very long gaps between turns pay full price again.
Caching is configured per model. Check the model config doc if your numbers look off.

Official docs: https://code.claude.com/docs/en/model-config.md#prompt-caching-configuration

What it does

When to use it

Gotchas

Related Guides

Terminal CLI - Claude Code

Interactive Mode - Claude Code

Keyboard Shortcuts - Claude Code

Related Tools

Claude Code

Codeburn

Claude Opus 4.7

Conductor

Related Videos

Prompt Caching: Anthropic Claude 3.5 Sonnet's Game-Changing Update!

Open Design: Turn Websites into Design Assets for Cursor & Claude Code

Nimbalyst: The Open-Source Visual Workspace for Building with Codex and Claude Code

Related Posts

Claude Outages Are a Workflow Design Problem

Claude Opus 4.8 Is an Agent Honesty Release

Anthropic Sonnet 4.5 in Claude Code

HalluSquatting Makes AI Coding Agents a Supply-Chain Problem

Setting Up a Spare Mac for Claude Code: The Full Remote Control Guide

Claude Code's Silent 60-Second Timer: A Misfeature Postmortem

Get Smarter About AI Dev

What it does

When to use it

Gotchas

Related Guides

Terminal CLI - Claude Code

Interactive Mode - Claude Code

Keyboard Shortcuts - Claude Code

Related Tools

Claude Code

Codeburn

Claude Opus 4.7

Conductor

Related Videos

Prompt Caching: Anthropic Claude 3.5 Sonnet's Game-Changing Update!

Open Design: Turn Websites into Design Assets for Cursor & Claude Code

Nimbalyst: The Open-Source Visual Workspace for Building with Codex and Claude Code

Related Posts

Claude Outages Are a Workflow Design Problem

Claude Opus 4.8 Is an Agent Honesty Release

Anthropic Sonnet 4.5 in Claude Code

HalluSquatting Makes AI Coding Agents a Supply-Chain Problem

Setting Up a Spare Mac for Claude Code: The Full Remote Control Guide

Claude Code's Silent 60-Second Timer: A Misfeature Postmortem

Get Smarter About AI Dev