74 items
47 posts, 27 tools
deepseek-chat is deprecated and disappears July 24, 2026 - here is how to migrate to V4 Flash or Pro, with verified pricing, thinking-mode mapping, and a step-by-step checklist.
DeepSeek's open-weights frontier family, previewed April 24, 2026. V4-Pro is 1.6T total / 49B active params; V4-Flash is 284B / 13B. 1M context standard. Weights on Hugging Face.
Open-source AI coding agent for terminal, desktop, and IDE. Works with 75+ LLM providers including Claude, GPT, Gemini, and local models. Runs parallel sessions on one project.
Open-source AI agent built in Rust, now governed by the Agentic AI Foundation at the Linux Foundation. Desktop app, CLI, and API. 15+ model providers, 70+ MCP extensions.
Open-source AI gateway: call 100+ LLM providers in the OpenAI format via a Python SDK or proxy. Spend tracking, budgets, rate limiting, load balancing, and fallbacks built in.
Open-source LLM engineering platform: tracing, evals, prompt management, and datasets. Self-hostable, OpenTelemetry-native, with 50+ framework integrations.
Open-source cloud sandboxes for AI agents. Isolated environments that start in under 200ms, run code in Python, JavaScript, and more, and persist sessions up to 24 hours.
Google released DiffusionGemma today, a 26B MoE open model that generates entire 256-token blocks in parallel instead of one token at a time. Here is what that means for latency, local inference, and the post-autoregressive landscape.
A hands-on look at Mastra, the open source TypeScript framework for building production-ready AI agents and workflows -- with verified setup commands, honest tradeoffs, and current pricing.
The rsync Claude debate shows why teams need reproducible defect forensics before AI attribution becomes a public blame machine.
Forge hit the Hacker News front page with a strong claim: small local models can become much more useful at tool-calling when the harness catches structural failures, retries intelligently, and controls context.
Cline is a free, open-source VS Code extension that brings autonomous AI coding to your editor. It works with local models or cloud APIs, handles multi-file changes, and runs terminal commands without proprietary lock-in.
The trending Free Claude Code repo is not just about avoiding API bills. It points at a bigger developer-tool pattern: model gateways for AI coding agents.
jcode is trending because it competes on a less glamorous but important agent metric: how cheap it is to keep many coding sessions alive.
Microsoft's lib0xc landed on Hacker News with a practical message: safer systems code often means better C APIs, warnings, bounds checks, and incremental adoption, not a heroic rewrite.
Hugging Face's ml-intern is trending because it narrows the agent loop around one domain: papers, datasets, model training, Hub traces, and ML shipping workflows.
Warp going open source is not just a terminal story. It is a signal that AI coding tools are shifting from chat UX toward agent operations, where planning, execution, review, and feedback loops live close to the shell.
DeepSeek V4 splits into Flash and Pro, ships a 1M context window, and undercuts every closed model on price. Here's how to wire it up with the OpenAI SDK, when to pick it over Claude or GPT, and what changed since V3 and R1.
A practical walkthrough of Nemotron 3 Super: latent mixture of experts, hybrid Mamba transformer architecture, 1M context, reasoning modes, and the code you actually need to run it on NVIDIA hardware.
The MCP ecosystem crossed 22,000 servers in early 2026. Most are noise. Here are the open-source servers that have earned a permanent slot in our config, with copy-paste setup for Claude Code, Cursor, and Codex.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.