130 items
66 posts, 53 tools, 11 guides
Opus 4.7 is here. Sharper coding, longer agentic runs, better tool use, and a price that finally makes Opus livable for production. Here's everything devs need to know.
Cloudflare Flagship is feature flags built for AI: model swaps, agent gates, and prompt rollouts as first-class primitives. Here is how to use it without rebuilding your control plane.
A hands-on developer guide to Mercury 2 from Inception Labs. OpenAI-compatible API, reasoning levels, tool use, structured outputs, and when a diffusion LLM beats an autoregressive one in real apps.
SAM 3.1 finally hits the latency budget for realtime video. Here is how to wire Meta's new segmentation model into a production pipeline without melting your GPU.
Vercel just declared the agent stack: AI Gateway, Sandbox, Flags, and Microfrontends. Here is how the four primitives compose, with code, and where each one actually fits in a real product.
Python's de facto data validation library. Type-hint-driven models, fast Rust-based core (v2), and the foundation of FastAPI, LangChain, and most Python AI tooling.
A repo-root DESIGN.md gives Claude Code, Codex, and other agents the design rules they need to honor so generated UI does not drift into generic territory.
Anthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x faster than Sonnet 4.5. $1/$5 per million tokens.
Alibaba's flagship open-weight coding model. 480B total parameters, 35B active (MoE). Native 256K context, scales to 1M. Apache 2.0 license. State-of-the-art agentic coding.
DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships alongside V3.2-Speciale, which rivals GPT-5 and Gemini 3.0 Pro.
AI meeting notes that augment your own typing. Records the call, captures what you type, and fills in the gaps with context from the transcript.
Ask quick side questions without derailing the main task.
Prevent bloating the main conversation with research or exploration.
Fires on tool execution errors for logging, alerting, and retry.
Alibaba's newest Qwen release claims flagship-level coding in a 27B dense model. Here is why dense matters, where it fits against the 480B MoE coder, and what it unlocks for local inference.
From Claude Code to Gladia, the ten CLIs every AI-native developer should know. Install commands, trade-offs, and when to reach for each.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.