Karpathy CLAUDE.md Skills: Use the Viral Rules as a Menu, Not a Template

The most interesting developer-tool signal this week is not a new model. It is a plain instruction file.

The GitHub repo forrestchang/andrej-karpathy-skills packages a CLAUDE.md, Cursor rule, and Claude Code plugin around four coding-agent principles inspired by Andrej Karpathy's public comments on LLM coding failure modes. As of this run, GitHub shows about 108k stars and 10.7k forks.

That is wild for a repo whose core artifact is basically a behavioral checklist.

It is also the right kind of wild. The repo went viral because teams have discovered the same thing at the same time: coding agents do not only need better models. They need better operating constraints.

If you are new to this layer, start with how to write a CLAUDE.md file and why skills beat prompts for coding agents. This post is the next step: how to interpret a viral rules file without letting it become another bloated prompt dump.

What the repo actually says

The useful part is short. The CLAUDE.md file centers on four principles:

Think before coding.
Keep the implementation simple.
Make surgical changes.
Define success criteria and verify them.

The repo's README maps those principles to common agent failures: hidden assumptions, overbuilt abstractions, unrelated edits, and vague "make it work" loops. The full file is only about 65 lines, which is part of why it spread. Developers can understand it, copy it, and argue with it in one sitting.

That last part matters. Good agent instructions are not sacred text. They are editable work rules.

Why this hit a nerve

Most agent failures are not dramatic model failures. They are small workflow failures repeated quickly.

The agent silently picks one interpretation of an ambiguous task. It writes a flexible abstraction for a one-off requirement. It "cleans up" adjacent code and creates a regression. It says something is done because the diff exists, not because the behavior was verified.

That is why a repo like this can become a trending event. It names the boring failure modes that show up in real diffs. The same issue shows up in the agent reliability cliff: the demo looks fine, then the production loop collapses because assumptions, tests, and ownership were never made explicit.

The opposing view is worth taking seriously too. A Reddit thread around the repo had a good skeptical read: the star count may say more about copy-pasteability and Karpathy name value than measured capability. Another commenter framed it as a menu rather than a template, which is the right mental model.

Stars prove demand. They do not prove effectiveness in your repo.

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.

From the archive

The 98% Context Reduction Pattern

May 2, 2026 • 8 min read

Agent Swarms Need Receipts

May 2, 2026 • 8 min read

Agentic Search Works Best When It Writes Queries, Not Answers

May 2, 2026 • 8 min read

Approval Fatigue Is an Agent Security Bug

May 2, 2026 • 7 min read

The mistake is copying it unchanged

The fastest way to misuse this repo is to append the whole thing to every project and call it done.

Generic rules are helpful until they conflict with local reality. "Surgical changes" means something different in a package migration, a design-system cleanup, a schema refactor, and a one-line bug fix. "Ask when uncertain" is right for product ambiguity, but it is wasteful when the codebase already has a clear pattern the agent can inspect.

This is where Claude Code skills and CLAUDE.md should work together:

CLAUDE.md should hold the global rules every session needs.
Skills should hold procedures that only matter for specific tasks.
Repo docs should point to real files, commands, tests, and failure modes.
Hooks should enforce what prose instructions cannot reliably enforce.

For the hook layer, see Claude Code hooks explained. The short version: if a rule can be checked automatically, do not leave it as vibes in a markdown file.

Turn viral rules into local rules

Here is the practical translation.

Do not write:

Be simple.

Write:

Do not add a new abstraction unless it removes duplication in at least two call sites or matches an existing pattern in this repo.

Do not write:

Make surgical changes.

Write:

When editing an existing route, only touch the files required for that route unless a failing test proves shared code must change.

Do not write:

Verify your work.

Write:

For UI changes, run the app locally, capture desktop and mobile screenshots, and mention any viewport you did not verify.

That is the difference between a motivational instruction and an operating constraint. The first one sounds correct. The second one changes behavior.

The best agents need fewer generic words

The lesson from this repo is not that every project needs a bigger CLAUDE.md.

It is the opposite. The best instruction files get shorter at the top and more specific at the leaves.

The global file should contain durable judgment:

how much autonomy the agent has
when to ask questions
how to handle unrelated changes
what must be verified before stopping
which design, content, or security rules are non-negotiable

Then task-specific skills should take over. A blog-writing skill, migration skill, review skill, release skill, or browser-QA skill can include the exact workflow for that slice without forcing every session to carry every rule.

That is also why agent teams and subagents are becoming more important. The main agent should not need every procedure in its context. It should know when to delegate to a specialist with the right local instructions.

My take

andrej-karpathy-skills is valuable because it is small, legible, and pointed at real failure modes.

It is not valuable because 108k people starred it. It is not valuable because a famous name is adjacent to the idea. It is valuable because it gives developers a shared vocabulary for the behavior they already wanted from coding agents: think first, stay simple, touch less, verify more.

The best move is to steal the shape, not the file.

Copy the four categories into your own repo. Delete anything that does not apply. Add concrete commands, file paths, test gates, and design constraints. Split repeated procedures into skills. Put mechanical checks into hooks. Then review the agent's diff and ask the only question that matters:

Did these instructions make the work smaller, clearer, and easier to verify?

If yes, keep them. If not, rewrite them. Agent instructions are code-adjacent infrastructure now. Treat them like something that has to earn its place in the repo.

How to Write a CLAUDE.md: The Complete 2026 Guide

Why Skills Beat Prompts for Coding Agents in 2026

What Are Claude Code Skills? A Complete Beginner Guide

What the repo actually says

Why this hit a nerve

The 98% Context Reduction Pattern

Agent Swarms Need Receipts

Agentic Search Works Best When It Writes Queries, Not Answers

Approval Fatigue Is an Agent Security Bug

The mistake is copying it unchanged

Turn viral rules into local rules

The best agents need fewer generic words

My take

Comments

Try These Tools

Related Tools

Claude Code

Apps from Developers Digest

Agent Hub

Skill Builder

Skills Pro

Related Guides

Claude Code Setup Guide

CLAUDE.md Files - Claude Code

Memory Command - Claude Code

Related Posts

How to Write a CLAUDE.md: The Complete 2026 Guide

Why Skills Beat Prompts for Coding Agents in 2026

What Are Claude Code Skills? A Complete Beginner Guide

Claude Code Agent Teams, Subagents, and MCP: The 2026 Playbook

The Agent Reliability Cliff: Why Your 10-Step Chain Only Succeeds 20% of the Time

Agent Replays with TraceTrail: Loom for Agent Runs

Get Smarter About AI Dev

How to Write a CLAUDE.md: The Complete 2026 Guide

Why Skills Beat Prompts for Coding Agents in 2026

What Are Claude Code Skills? A Complete Beginner Guide

What the repo actually says

Why this hit a nerve

The 98% Context Reduction Pattern

Agent Swarms Need Receipts

Agentic Search Works Best When It Writes Queries, Not Answers

Approval Fatigue Is an Agent Security Bug

The mistake is copying it unchanged

Turn viral rules into local rules

The best agents need fewer generic words

My take

Comments

Try These Tools

Related Tools

Claude Code

Apps from Developers Digest

Agent Hub

Skill Builder

Skills Pro

Related Guides

Claude Code Setup Guide

CLAUDE.md Files - Claude Code

Memory Command - Claude Code

Related Posts

How to Write a CLAUDE.md: The Complete 2026 Guide

Why Skills Beat Prompts for Coding Agents in 2026

What Are Claude Code Skills? A Complete Beginner Guide

Claude Code Agent Teams, Subagents, and MCP: The 2026 Playbook

The Agent Reliability Cliff: Why Your 10-Step Chain Only Succeeds 20% of the Time

Agent Replays with TraceTrail: Loom for Agent Runs

Get Smarter About AI Dev