API

18 items

12 posts, 5 tools, 1 guide

BlogJun 30, 2026

Gemini 3.5 Pro Developer Guide: 2M Context Window and Deep Think Mode

Google's Gemini 3.5 Pro arrives with a 2-million-token context window and Deep Think reasoning mode. Here is how to access it, what it costs, and when the massive context actually helps.

Gemini Google AI API Context Window Developer Guide

BlogJun 28, 2026

OpenAI's June API Updates Are Really a Control-Plane Upgrade

OpenAI's June 2026 API changelog looks like scattered platform plumbing. Read together, moderation scores, workload identity, Admin APIs, prompt-cache retention, container billing, and Secure MCP Tunnel are the pieces teams need to run agents with real controls.

OpenAI AI Agents API Security Developer Tools

BlogJun 11, 2026

Qwen 3.7 Max Developer Guide: 1M Context, $1.25/MTok, and Agent-First Architecture

Alibaba shipped Qwen 3.7 Max on May 19, 2026 with a 1M token context window, Anthropic-compatible API, and agent-first architecture. Here is what developers need to know about pricing, performance, and when to use it.

Qwen Alibaba AI Models API Coding

BlogJun 10, 2026

Handling Fable 5 Refusals: A Working Guide to the Fallback API

Fable 5 ships with safety classifiers that route flagged requests away from the model. In production you need to handle this, and Anthropic shipped three ways to do it. Here's how each one works, with code, plus the billing rules nobody has written up.

Claude Fable 5 Anthropic API Agents

BlogJun 10, 2026

Fable 5 Broke Enterprise ZDR Agreements: What Dev Teams Must Do Now

Anthropic's Claude Fable 5 mandates 30-day data retention on every platform, overriding existing Zero Data Retention contracts for enterprise API customers. Here is what compliance teams and developers need to audit before their next deployment.

anthropic enterprise-compliance data-retention api security claude

BlogJun 10, 2026

Fable 5 Before June 22: The Decision Checklist for Every Plan Tier

12 days out from the Fable 5 promotional window closing on claude.ai, here is the practical checklist for Pro users, Max subscribers, teams, and API developers - what to decide, what to test, and what not to worry about.

claude fable-5 billing checklist api anthropic

BlogJun 10, 2026

Why Fable 5 Refuses Your Cybersecurity Queries (And How the Fallback Works)

Claude Fable 5 routes blocked queries to Opus 4.8 rather than refusing outright - but the fallback is not automatic for API users and requires explicit configuration. Here is the complete developer guide to the refusal architecture.

Claude AI Safety API Developer Tools Anthropic

BlogJun 10, 2026

Migrating to Claude Fable 5: The Practical Guide

Fable 5 is mostly a drop-in replacement for Opus 4.8, but 'mostly' is doing real work in that sentence. Here's every breaking change, what to delete from your code, and the prompt audit you should run before flipping the model ID.

Claude Fable 5 Anthropic API Migration

BlogJun 10, 2026

OpenRouter in 2026: Review, Setup, and When Model Routing Pays

OpenRouter gives you one API key for 300+ models, automatic fallbacks, and intelligent provider routing. Here is what it actually costs, how to set it up in five minutes, and when you should skip it entirely.

ai-tools api model-routing developer-tools llm

BlogApr 29, 2026

DeepSeek V4: The Developer's Guide to Flash and Pro

DeepSeek V4 splits into Flash and Pro, ships a 1M context window, and undercuts every closed model on price. Here's how to wire it up with the OpenAI SDK, when to pick it over Claude or GPT, and what changed since V3 and R1.

DeepSeek Open Source AI Models API

BlogApr 29, 2026

Mercury 2 Developer Guide: Building With a Diffusion LLM in Production

A hands-on developer guide to Mercury 2 from Inception Labs. OpenAI-compatible API, reasoning levels, tool use, structured outputs, and when a diffusion LLM beats an autoregressive one in real apps.

AI LLM Mercury Diffusion Inception Labs API Tutorial

BlogApr 29, 2026

Assistants to Responses API: A Migration Field Guide

OpenAI is sunsetting the Assistants API in 2026. Here is a tested migration plan to the Responses API - code, state, threads, tools, every cliff I hit, in order.

OpenAI Responses API Assistants API Migration API

GuideApr 23, 2026

Routines (Web) - Claude Code

Managed scheduling on Anthropic infrastructure with API and GitHub triggers.

ToolApr 9, 2026

Replicate

Run 50,000+ ML models with a simple API. No infrastructure management. Pay-per-second billing. Deploy custom models with Cog. Popular for image generation and audio.

infrastructure api models gpu inference image-generation

ToolApr 9, 2026

Together AI

Fastest inference for open-source models. 200+ models via unified API. Ranks #1 on speed benchmarks for DeepSeek, Qwen, Kimi, and Llama. Serverless pay-per-token pricing.

infrastructure inference api open-source-models gpu fast

ToolApr 9, 2026

Groq

LPU-powered inference delivering 500-1,000+ tokens/sec. Purpose-built chip with on-chip SRAM instead of HBM. 5-10x faster than GPU providers. Free tier available.

infrastructure inference lpu fast hardware api

ToolApr 9, 2026

Cerebras

Wafer-scale AI inference at 3,000+ tokens/sec. The WSE-3 chip has 4 trillion transistors and 900K AI cores. 20x faster than GPU providers. OpenAI partnership for inference.

infrastructure inference wafer-scale hardware fast api

ToolMar 22, 2026

OpenRouter

Unified API for 200+ models. One API key, one billing dashboard. OpenAI, Anthropic, Google, Meta, Mistral, and more. Automatic fallbacks and load balancing.

ai model api gateway multi-model routing

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags