TL;DR
zilliztech/claude-context landed on GitHub's daily trending list with 873 new stars today - here's what this Claude Code MCP actually does and whether it's worth the setup.
Read next
zilliztech/claude-context is an MCP server that indexes your entire codebase with hybrid vector search, letting Claude Code find relevant code without loading whole directories. It hit 8.8k stars and is trending on both daily and weekly GitHub charts.
6 min readCodeGraph builds a local SQLite index of your codebase so Claude Code, Cursor, and Codex CLI spend far fewer tokens exploring files - trending on GitHub with 12k stars and real benchmark numbers.
6 min readRuflo crossed 37,700 GitHub stars this week, adding nearly 1,900 in a single day. It turns Claude Code into a coordinated swarm of 100+ specialized agents with MCP integration, distributed vector memory, and zero-trust agent federation.
7 min readzilliztech/claude-context picked up 873 stars in a single day, landing it at the top of GitHub's daily trending list with a total of over 7,100 stars. For a tool with a narrow, specific purpose - semantic code search inside Claude Code - that kind of velocity signals genuine developer demand, not just hype.
The timing makes sense. Claude Code has become the go-to AI coding environment for a lot of engineers, and one of its persistent frustrations is context: large repos either overwhelm the context window or require constant manual file-opening to point the model at the right code. claude-context is a direct answer to that friction. It ships as an MCP server that lets Claude Code search your codebase semantically rather than loading it wholesale.
claude-context is an MCP (Model Context Protocol) plugin for Claude Code. It indexes your repository using hybrid search - combining BM25 (keyword-based) and dense vector retrieval - and exposes four MCP tools the model can call directly:
index_codebase - crawls a directory and builds the search indexsearch_code - runs natural language queries against that indexget_indexing_status - checks progress on large reposclear_index - removes an index when you no longer need itThe indexing pipeline uses Abstract Syntax Trees to chunk code intelligently rather than splitting by line count. That matters because naive chunking can cut a function in half, making retrieved snippets useless. By understanding code structure - functions, classes, methods - the chunker preserves semantic units.
For vector storage, the project integrates with Milvus (self-hosted) or Zilliz Cloud (their hosted product). Embeddings can come from OpenAI, VoyageAI, Ollama, or Gemini, giving you some flexibility on that dependency. Supported languages cover the common bases: TypeScript, JavaScript, Python, Go, Rust, Java, C++, C#, Ruby, Swift, Kotlin, Scala, and Markdown.
The headline metric from the README: roughly 40% token reduction compared to loading full directories into context, at equivalent retrieval quality. That's a meaningful number if accurate - it translates directly to lower API costs and faster responses on large repos.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Apr 22, 2026 • 8 min read
Apr 22, 2026 • 8 min read
Apr 22, 2026 • 7 min read
Apr 22, 2026 • 10 min read
You will need Node.js 20 or later (the README notes it is not compatible with Node 24+), an OpenAI API key for embeddings, and a Zilliz Cloud account with an API key and public endpoint.
Once those are in place, add the MCP server to Claude Code with a single command:
claude mcp add claude-context \
-e OPENAI_API_KEY=sk-your-openai-api-key \
-e MILVUS_ADDRESS=your-zilliz-cloud-public-endpoint \
-e MILVUS_TOKEN=your-zilliz-cloud-api-key \
-- npx @zilliz/claude-context-mcp@latest
After that, open Claude Code in your project directory:
cd your-project-directory && claude
Then in the conversation, tell it to index:
Index this codebase
Once indexing completes, you can run natural language queries:
Find functions that handle user authentication
Claude Code will call search_code, retrieve relevant code chunks, and use them as context rather than asking you to open files manually. The incremental indexing backed by Merkle trees means subsequent runs only re-process changed files, so the upfront cost is paid once.
The primary target is engineers working on large, unfamiliar codebases - open-source contributors jumping into a new project, developers onboarding to a large enterprise repo, or anyone maintaining a multi-service monorepo where finding the right file is itself a task.
It also helps with the opposite problem: developers who know their codebase well but are tired of the ritual of manually surfacing files to Claude before asking questions. With claude-context, the retrieval step is automatic.
Teams that already pay for OpenAI embeddings and have a Zilliz Cloud tier will see the least friction. The "install and go" experience requires both external services to be configured, so the true zero-cost self-hosted path - Ollama for embeddings plus local Milvus - takes more work but is fully supported.
If you are running a small project under a few thousand files, the value proposition is weaker. Claude Code's built-in file handling is often sufficient at that scale. This tool earns its complexity at medium-to-large repo sizes where context window pressure is a real constraint.
MCP servers are a core part of what DevDigest covers, and claude-context is a concrete example of the MCP pattern maturing beyond simple tool integrations. It is not exposing an API or wrapping a web service - it is augmenting how the model perceives a local environment, which is the more interesting class of MCP use case.
If you are exploring the MCP landscape, the curated list at mcp.developersdigest.tech is a good companion to this post. It tracks production-quality MCP servers across categories, and semantic code search is a category that is filling in fast.
There is also a natural overlap with the hooks and skills patterns covered here. An index-on-save hook - triggering index_codebase automatically when files change - is a reasonable extension of this tool's value. The manual "index this codebase" step is the current friction point, and a Claude Code hook or a file-watcher script could eliminate it.
The broader trend claude-context represents: context is becoming a first-class engineering problem in AI-native development. Tools that manage, compress, or intelligently retrieve context are going to be a persistent category.
The 40% token reduction claim is compelling but comes with a caveat: it holds under equivalent retrieval quality. Retrieval quality is not guaranteed - it depends on query phrasing, index freshness, and whether the embedding model understands your codebase's domain vocabulary. You will get mileage out of it proportional to how well you write queries.
The dependency chain is real. You need OpenAI (or an alternative) for embeddings and Zilliz/Milvus for storage. That is two external services before you see any value. For individual developers, the Zilliz Cloud free tier may cover light use, but teams should budget for it.
Node version compatibility is a genuine rough edge - requiring Node 20-23 and explicitly breaking on 24 is the kind of constraint that creates silent failures on developer machines. Check your Node version before troubleshooting.
On the upside: MIT license, active maintenance signals from the trending velocity, and a VS Code extension alongside the MCP server suggest this is a project with real investment behind it. The architecture - separating the core indexing engine, the VS Code integration, and the MCP server into distinct packages - is clean and suggests the internals are worth building on.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolInteractive TUI dashboard that shows exactly where your Claude Code and Cursor tokens are going, in real time.
View ToolAI coding assistant with deep codebase context. Indexes your entire repo graph for accurate answers. VS Code and JetBrai...
View ToolAI-powered context manager that remembers your code, snippets, links, and project context across IDEs, browsers, and ter...
View ToolUnlock pro skills and share private collections with your team.
View AppCatch broken SKILL.md files in CI before they hit your team.
View AppEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppA concrete step-by-step guide to moving your development workflow from Cursor to Claude Code - settings, rules, keybindings, and the habits that transfer.
Getting StartedAuto-memory that persists across multiple subagent invocations.
Claude CodeDeferred tool loading reduces context overhead for large MCP suites.
Claude Code
Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Composio: Connect AI Agents to 1,000+ Apps via CLI (Gmail, Google Docs/Sheets, Hacker News Workflows) Check out Composio here: http://dashboard.composio.dev/?utm_source=Youtube&utm_channel=0426&utm_...

Anthropic has released Channels for Claude Code, enabling external events (CI alerts, production errors, PR comments, Discord/Telegram messages, webhooks, cron jobs, logs, and monitoring signals) to b...
CodeGraph builds a local SQLite index of your codebase so Claude Code, Cursor, and Codex CLI spend far fewer tokens expl...
CodeGraph hit 7,800+ stars with 1,900 added in a single day - a local MCP knowledge graph that lets Claude Code explore...
agentmemory is a self-hosted MCP server that gives Claude Code, Cursor, and Gemini CLI searchable long-term memory acros...
agentmemory gives AI coding agents a persistent brain - capturing session context automatically via 12 Claude Code hooks...
Ruflo crossed 37,700 GitHub stars this week, adding nearly 1,900 in a single day. It turns Claude Code into a coordinate...
zilliztech/claude-context is an MCP server that indexes your entire codebase with hybrid vector search, letting Claude C...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.