11 items
1 post, 9 tools, 1 guide
Anthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 per million tokens. Built for long-horizon agentic work.
A production guide to Claude's extended thinking mode. Real cost math, TypeScript SDK code, and the tasks where reasoning tokens are worth 3x the spend.
Anthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token context window. Available via API and in Claude Code.
DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships alongside V3.2-Speciale, which rivals GPT-5 and Gemini 3.0 Pro.
Google's frontier model family. Gemini 2.5 Pro has 1M token context and top-tier coding benchmarks. Gemini 3 Pro pushes reasoning further. Free tier via AI Studio.
Open-source reasoning models from China. DeepSeek-R1 rivals o1 on math and code benchmarks. V3 for general use. Fully open weights. Extremely cost-effective API.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.