AI MODELS

47 items

46 posts, 1 guide

BlogJun 11, 2026

Handling Long-Running Fable 5 Requests: Timeouts, Streaming, and Background Patterns

Fable 5 long-running requests can run for many minutes per turn and hours per autonomous run. Here is how to configure client timeouts, streaming keepalive, batch polling, and background patterns so they actually finish.

Anthropic AI Models Developer Tools

BlogJun 11, 2026

The Fable 5 Orchestrator Playbook: One Smart Model Managing Cheap Workers

A practical playbook for running Claude Fable 5 as the orchestrator over Sonnet and Haiku workers, with verified cost math on when the premium pays off.

AI Agents Anthropic AI Models LLMs

BlogJun 11, 2026

The Frontier Model Landscape, June 2026 Edition

A verified directory of the frontier AI models in June 2026 - Claude Fable 5, GPT-5.5, GPT-5.4, Gemini 3.1 Pro, and DeepSeek V4 - with pricing checked against official docs.

AI Models LLMs Pricing Developer Tools

BlogJun 11, 2026

How to Use Claude Fable 5: Every Access Path Explained

How to use Claude Fable 5 across every access path: claude.ai plans through June 22, the Claude API, Amazon Bedrock, Vertex AI, and Microsoft Foundry, with setup effort and first-prompt tips.

AI Models Anthropic Developer Tools

BlogJun 11, 2026

Is Claude Fable 5 Slow? Latency in Practice, and When It Matters

Claude Fable 5 latency measured: 109 seconds to first token at max effort vs 1.4s for Sonnet 4.6. When slow is fine, when it hurts, and how to route around it.

AI Models Anthropic LLMs Performance

BlogJun 11, 2026

Migrating Off Retired GPT Models in 2026: A Working Checklist

Migrating off retired GPT models in 2026: the live retirement table, what maps to what, an eval-before-switch day plan, and when to jump providers.

OpenAI AI Models LLMs Developer Tools

BlogJun 11, 2026

Qwen 3.7 Max Developer Guide: 1M Context, $1.25/MTok, and Agent-First Architecture

Alibaba shipped Qwen 3.7 Max on May 19, 2026 with a 1M token context window, Anthropic-compatible API, and agent-first architecture. Here is what developers need to know about pricing, performance, and when to use it.

Qwen Alibaba AI Models API Coding

BlogJun 11, 2026

12 Ways Developers Are Actually Leveraging Claude Fable 5

Twelve documented Claude Fable 5 use patterns - agent orchestration, overnight runs, 1M-context refactors, effort tuning - each with a how-to seed and doc link.

AI Models Anthropic AI Agents Developer Tools

BlogJun 10, 2026

Decoding Anthropic's Model Names: Fable, Mythos, and What the Naming Shift Signals

Anthropic broke its own naming ladder when it introduced the Mythos class and Claude Fable 5. Here is what the shift means, how to map each tier to a real workload, and what questions it leaves open.

anthropic claude ai-models fable-5 model-selection pricing

BlogJun 10, 2026

Apple's LanguageModel Protocol: Xcode 27 Just Made Model Lock-In Optional

Apple shipped a LanguageModel protocol at WWDC 2026 that lets iOS and macOS developers swap between Claude, Gemini, and local models with a single dependency change. Here is what OS-level provider abstraction actually means for switching costs, moats, and your architecture decisions.

apple developer-tools ai-models xcode model-abstraction wwdc

BlogJun 10, 2026

Fable 5 vs Opus 4.8: A Data-Driven Decision Guide for Engineering Teams

Fable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you when the premium pays off.

AI Models Anthropic Code Review AI Agents Developer Tools LLMs

BlogJun 10, 2026

Claude Mythos 5 Explained: What It Is, Who Can Access It, and Why It's Gated

Anthropic shipped two names for one architecture on June 9, 2026. Here is what separates Fable 5 from Mythos 5, who can actually get unrestricted access, and what developers should do right now.

Anthropic Claude AI Models Cybersecurity News Analysis

BlogMay 30, 2026

The Model, IDE, CLI, and Agent Framework Changes That Actually Matter

The AI coding market is noisy. The changes that matter are easier to spot when you separate model capability, editor loops, terminal agents, background agents, agent frameworks, UI layers, context, security, and cost.

AI Coding Developer Tools AI Models AI Agents Agent Frameworks Codex Claude Code

BlogMay 23, 2026

Models.dev Makes Model Routing Feel Like Infrastructure

The models.dev project is trending because AI teams need one boring source of truth for model specs, pricing, context windows, modalities, and tool support.

AI Models Developer Tools Pricing AI SDK Infrastructure

BlogMay 2, 2026

DeepSeek V4 Changes the Coding Agent Cost Equation

DeepSeek V4 is trending because it is close enough to frontier coding models at a much lower token price. The real question for developers is where cheap reasoning belongs in an agent stack.

DeepSeek AI Coding AI Models Agents Cost Optimization

BlogApr 29, 2026

DeepSeek V4: The Developer's Guide to Flash and Pro

DeepSeek V4 splits into Flash and Pro, ships a 1M context window, and undercuts every closed model on price. Here's how to wire it up with the OpenAI SDK, when to pick it over Claude or GPT, and what changed since V3 and R1.

DeepSeek Open Source AI Models API

BlogApr 29, 2026

NVIDIA Nemotron 3 Super: A Developer's Guide to the 120B Hybrid MoE

A practical walkthrough of Nemotron 3 Super: latent mixture of experts, hybrid Mamba transformer architecture, 1M context, reasoning modes, and the code you actually need to run it on NVIDIA hardware.

NVIDIA Nemotron MoE Mamba Open Source AI Models Triton Transformers

GuideApr 9, 2026

Run AI Models Locally with Ollama and LM Studio

Install Ollama and LM Studio, pull your first model, and run AI locally for coding, chat, and automation - with zero cloud dependency.

BlogApr 2, 2026

Claude Haiku 4.5: Near-Frontier Intelligence at a Fraction of the Cost

Anthropic's Claude Haiku 4.5 delivers Sonnet 4-level coding performance at one-third the cost and twice the speed. Here is what developers need to know.

Claude Anthropic AI Models

BlogMar 26, 2026

DeepSeek R1 and V3: The Developer's Guide to Open-Source AI

DeepSeek's R1 and V3 models deliver frontier-level performance under an MIT license. Here's how to use them through the API, run them locally with Ollama, and decide when they beat closed-source alternatives.

DeepSeek Open Source AI Models Local AI

PreviousPage 2 of 3Next

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags