201 items
195 posts, 2 tools, 4 guides
Skills gave an agent what to know. The missing half is what role to play. Agent Studio lets you author subagents next to your skills in one place, serve both over the same MCP endpoint with the same progressive disclosure, browse them over REST and the dd CLI, and publish them to the community under a moderation loop. Here is the design and why the two belong in one studio.
Describe an app in plain language and get a working single-file build back with a live sandboxed preview. Revise it by talking to it, share it with a link, or download the file. Here is what single-file buys you, how revisions work, the honest limits, and what it costs.
Skills, files, memory, and generation do not need four integrations. They need one MCP endpoint with tiered disclosure, one API key that scopes everything to its owner, and one credit balance. The same tools answer to an MCP client, an in-product chat, and a CLI. Here is the whole architecture, and why it is the shape that makes a fleet of agents coherent.
A fair, sourced comparison of the memory layers developers reach for in 2026: Mem0's extract-and-retrieve, Zep's temporal knowledge graph, Letta's self-editing agent memory, and Cloudflare's Durable Objects primitive. Architecture, pricing, the benchmark disputes, and which to pick for your agent.
A decision framework for 2026: MCP servers give an agent access to a live system, Agent Skills teach it how to do a task. Here is when to build each, when to build both, and the criteria that actually decide it, grounded in the MCP spec and Anthropic's skills docs.
OpenAI's workplace agent data points to a practical shift: non-developers are starting to use agents for real work, so engineering teams need paved paths, policy, and receipts.
The first version of skills-over-MCP served a fixed first-party catalog. Skill Studio extends it two ways: anyone can author skills that ride the same progressive-disclosure endpoint scoped to their own API key, and a skill file can be a link instead of a copy - a URL whose bytes are only fetched at the moment an agent decides it needs them. Progressive disclosure stops at the skill boundary no longer. It runs out to the open web.
One expensive orchestrator plus many cheap workers beats an all-frontier fleet for most workloads. Here is the decision-intent cost math with verified Fable 5, Sonnet 5, and Opus 4.8 prices, plus the Sonnet 5 tokenizer caveat that changes worker cost.
A builder's guide to picking a code-execution sandbox for AI agents - E2B, Daytona, Modal, Cloudflare Sandbox, and Vercel Sandbox compared on isolation, latency, state, and pricing model.
Cloudflare announces native support for the x402 HTTP payment protocol, letting developers charge for API calls and web resources with stablecoin micropayments - no accounts or API keys required.
We rebuilt and replatformed this site in a day by running a fleet of AI agents in parallel. Here is the honest operating model - the ownership rules, the verification gate on every handoff, and the failure modes we hit, with the guardrail each one produced.
We retired the playful cream-and-pill design system for a hard-edged neutral, Vercel-inspired contract, and rebuilt the whole site in a day by coordinating parallel AI agents. Here is the design direction, the constraints we picked, how it was built, and what is next.
Fable 5 changes multi-agent orchestration because the orchestrator can now hold the whole project in one head. Here is the manager-model pattern: a 1M-context frontier model leading, delegating scoped work to cheaper workers, and verifying results.
Standing up a fleet of Fable 5 agents is the easy part. This is the operations layer - data retention rules, refusal-rate alerting, effort tuning, observability, and availability planning - that keeps the fleet running.
Vercel's eve gives you the agent plumbing - durable sessions, sandboxed code execution, approvals, subagents - as a folder of files. Fable 5 gives you a long-horizon reasoning model. Here is how to wire them together, what it costs, and who the stack fits.
The orchestrator is the most important model choice in an agent fleet. A fair head-to-head between Fable 5 and Opus 4.8 for that role, with a decision matrix by run length, budget, compliance, and refusal-handling tolerance.
Fable 5 refusals come back as a 200 response, not an error. At fleet scale, that quietly corrupts entire runs. Here is how to detect, fall back, and treat refusal rate as a health metric.
1M context, 128K output, a memory tool, compaction, and task budgets change what a single agent run can cover. Here is what is verified, what is plausible, and six projects builders can try now.
A companion guide to the Loop Engineering video: the shift from repeatedly prompting an LLM to building long-running loops, goals, and automations. Here is the core idea and where to go deeper.
The 2026-07-28 Model Context Protocol spec is the largest revision since launch: a stateless core, deprecated Roots/Sampling/Logging, MCP Apps, Tasks, and tougher OAuth. Here is what breaks, what to adopt, and a migration checklist for server authors and client integrators before the July 28 deadline.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.