TL;DR
A four-principle CLAUDE.md distilling Andrej Karpathy's observations on LLM coding mistakes gained 44k stars in a single week - and is now installable as a Claude Code plugin in two commands.
Read next
Matt Pocock's skills repository gained nearly 35,000 GitHub stars in a single week by addressing the four concrete failure modes that make AI-assisted coding frustrating in practice.
6 min readA single CLAUDE.md file inspired by Andrej Karpathy's observations about LLM coding mistakes gained 24,000 GitHub stars in one week. Here is what it teaches and why it resonated.
5 min readAnthropic just shipped an official curated plugin directory for Claude Code. It earned 2,500+ stars in a single day and changes how you extend your AI coding workflow.
5 min readAt 70,300 stars and growing, forrestchang/andrej-karpathy-skills is the most-starred repository on GitHub trending this week, gaining 44,394 stars in seven days. That number is striking for a project containing a single instruction file and a few supporting docs. No flashy framework. No new model. Just four precisely-worded principles that tell Claude exactly what it keeps getting wrong.
The repo distills a specific observation from Andrej Karpathy - the former Tesla and OpenAI researcher whose commentary on LLM behavior carries weight in the AI developer community. His core critique: models make unfounded assumptions, over-engineer solutions, and silently modify code they do not fully understand. This repo attempts to encode the fix in a format Claude Code can act on directly.
The heart of the project is a CLAUDE.md file built around four behavioral guidelines.
Think Before Coding - Claude is instructed to surface assumptions explicitly before writing a line of code. If a request is ambiguous, the model asks for clarification rather than guessing. A request like "add a user export feature" gets met with questions about scope (all users or filtered?), format, fields, and expected volume before any implementation begins.
Simplicity First - The guideline is direct: "Minimum code that solves the problem. Nothing speculative." This targets a well-known failure mode where models add caching layers, abstract base classes, and validation logic that nobody asked for. If the implementation grows unreasonably long, the instruction says to stop and rewrite.
Surgical Changes - When modifying existing code, Claude should touch only what the task requires. The rule is explicit about pre-existing dead code: do not remove it unless your own changes orphaned it. Match the existing code style and do not refactor adjacent functions as a bonus.
Goal-Driven Execution - Rather than working from vague instructions, Claude transforms each request into verifiable success criteria with clear checkpoints. "Review and improve this function" becomes "write a test reproducing the bug, implement the fix, verify the test passes, check for regressions." Each step can be independently verified.
The project also ships a CURSOR.md and a .cursor/rules/karpathy-guidelines.mdc for Cursor users, plus an EXAMPLES.md demonstrating how each principle changes behavior in concrete coding scenarios. The license is MIT. There are 28 commits on main, 6,400 forks, and 374 watchers as of this writing.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Apr 20, 2026 • 6 min read
Apr 19, 2026 • 13 min read
Apr 19, 2026 • 8 min read
Apr 19, 2026 • 9 min read
There are two paths to using these guidelines.
Claude Code Plugin (recommended):
/plugin marketplace add forrestchang/andrej-karpathy-skills
/plugin install andrej-karpathy-skills@karpathy-skills
That adds the guidelines as a plugin that applies across your Claude Code sessions without requiring a project-level file.
Per-project CLAUDE.md:
Download the file and append it to your existing CLAUDE.md:
curl -o karpathy-guidelines.md \
https://raw.githubusercontent.com/forrestchang/andrej-karpathy-skills/main/CLAUDE.md
cat karpathy-guidelines.md >> CLAUDE.md
This approach works well if you want to combine the guidelines with project-specific context - which is the more common setup for teams working on production codebases.
Cursor users can copy CURSOR.md to their project root, or place .cursor/rules/karpathy-guidelines.mdc in the repo for rules-based activation.
If you have spent meaningful time with Claude Code, you have almost certainly hit all four failure modes this file addresses. The assumption problem is especially costly - an incorrect silent assumption at the start of a session can send a long agentic task down the wrong path entirely, and debugging that is expensive in both time and token budget.
The simplicity guideline matters most for developers working on production codebases rather than greenfield projects. When Claude touches existing code, speculative additions create noise in diffs, inflate pull request reviews, and sometimes introduce bugs in code paths that were previously stable.
Surgical changes is the rule most likely to reduce friction when doing partial updates - fixing a single function in a large file, patching a query, or adjusting an API endpoint response shape. Without this constraint, Claude has a tendency to clean up surrounding code as a side effect, which is not always welcome.
Goal-driven execution is the most useful rule for agentic workflows where Claude runs multiple steps without interruption. Success criteria give the model a natural stopping condition and a way to verify its own work before reporting back.
These guidelines are worth the two-command install for any developer using Claude Code regularly. The cost is essentially zero.
This project sits squarely in the territory that skills.developersdigest.tech tracks - Claude Code skills, plugins, and behavioral configurations that extend what the tool can do. The karpathy-skills plugin is a lightweight, high-leverage configuration in the same category as community-built plugins for code review, deployment automation, and session context management.
The four principles map well to the workflow patterns covered throughout the DevDigest Claude Code content. The surgical changes principle connects directly to the agent loop patterns discussed in the Claude Code sub-agents and worktrees posts - agentic runs where Claude modifies multiple files in a single session are exactly where uncontrolled edits cause the most trouble.
The CLAUDE.md format itself is central to how many teams manage Claude Code behavior across sessions. These karpathy guidelines are a good complement to project-level context files because they address model-level behavioral patterns rather than project-specific rules. Combining them with automation configured at hooks.developersdigest.tech gives you behavioral tuning at the instruction level and automated guardrails at the execution level - two different layers working together.
For developers building with Claude Code extensively, this is the kind of low-effort, high-signal configuration that compounds over time.
The four principles are well-observed and the CLAUDE.md format is effective. The project earned its star count.
The limitations are worth noting. These guidelines work best as a starting point, not a finished solution. Teams with complex projects will likely need to extend them - the simplicity rule in particular requires calibration. Some abstractions are genuinely necessary, and a blanket instruction to avoid them can create friction when Claude correctly identifies a pattern that warrants one.
The "think before coding" principle adds overhead for trivial tasks. The CLAUDE.md itself acknowledges this, noting that judgment should be applied and caution weighted toward complex work.
The Claude Code plugin ecosystem is also still maturing. Plugin installation commands and behavior may shift as the platform evolves. Test the install in a clean session before rolling it out to a team.
None of these limitations undercut the core value. For developers who have been burned by silent assumptions or spec-creep from an agentic run, this file is a practical and immediate fix.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolAnthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token c...
View ToolHigh-performance code editor built in Rust with native AI integration. Sub-millisecond input latency. Built-in assistant...
View ToolInteractive TUI dashboard that shows exactly where your Claude Code and Cursor tokens are going, in real time.
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppClickable PR link in the footer with review state color coding.
Claude CodeA concrete step-by-step guide to moving your development workflow from Cursor to Claude Code - settings, rules, keybindings, and the habits that transfer.
Getting StartedPath-specific rules that only load for matching files.
Claude Code
Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Composio: Connect AI Agents to 1,000+ Apps via CLI (Gmail, Google Docs/Sheets, Hacker News Workflows) Check out Composio here: http://dashboard.composio.dev/?utm_source=Youtube&utm_channel=0426&utm_...

Anthropic has released Channels for Claude Code, enabling external events (CI alerts, production errors, PR comments, Discord/Telegram messages, webhooks, cron jobs, logs, and monitoring signals) to b...
Anthropic just shipped an official curated plugin directory for Claude Code. It earned 2,500+ stars in a single day and...
CodeGraph builds a local SQLite index of your codebase so Claude Code, Cursor, and Codex CLI spend far fewer tokens expl...
CodeGraph hit 7,800+ stars with 1,900 added in a single day - a local MCP knowledge graph that lets Claude Code explore...
agentmemory is a self-hosted MCP server that gives Claude Code, Cursor, and Gemini CLI searchable long-term memory acros...
agentmemory gives AI coding agents a persistent brain - capturing session context automatically via 12 Claude Code hooks...
Ruflo is an open-source multi-agent orchestration platform built specifically for Claude, shipping 100+ specialized agen...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.