149 items
76 posts, 62 tools, 11 guides
Cloudflare Flagship is feature flags built for AI: model swaps, agent gates, and prompt rollouts as first-class primitives. Here is how to use it without rebuilding your control plane.
A hands-on developer guide to Mercury 2 from Inception Labs. OpenAI-compatible API, reasoning levels, tool use, structured outputs, and when a diffusion LLM beats an autoregressive one in real apps.
SAM 3.1 finally hits the latency budget for realtime video. Here is how to wire Meta's new segmentation model into a production pipeline without melting your GPU.
Vercel just declared the agent stack: AI Gateway, Sandbox, Flags, and Microfrontends. Here is how the four primitives compose, with code, and where each one actually fits in a real product.
Python's de facto data validation library. Type-hint-driven models, fast Rust-based core (v2), and the foundation of FastAPI, LangChain, and most Python AI tooling.
A repo-root DESIGN.md gives Claude Code, Codex, and other agents the design rules they need to honor so generated UI does not drift into generic territory.
Anthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x faster than Sonnet 4.5. $1/$5 per million tokens.
Alibaba's flagship open-weight coding model. 480B total parameters, 35B active (MoE). Native 256K context, scales to 1M. Apache 2.0 license. State-of-the-art agentic coding.
DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships alongside V3.2-Speciale, which rivals GPT-5 and Gemini 3.0 Pro.
AI meeting notes that augment your own typing. Records the call, captures what you type, and fills in the gaps with context from the transcript.
Ask quick side questions without derailing the main task.
Prevent bloating the main conversation with research or exploration.
Fires on tool execution errors for logging, alerting, and retry.
Qwen3.6-27B keeps pulling developers back because it sits in the awkward, useful middle: strong enough for real local coding tasks, small enough for serious workstation testing, and cheap enough to benchmark honestly.
Most MCP servers are noise. After shipping 24 apps with Claude Code, these are the five I reach for every time.
From Claude Code to Gladia, the ten CLIs every AI-native developer should know. Install commands, trade-offs, and when to reach for each.
One dev, one CLI, 24 subdomains, and a lot of parallel agents. The playbook for shipping an AI app portfolio.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.