62 items
56 posts, 6 tools
OpenAI's workplace agent data points to a practical shift: non-developers are starting to use agents for real work, so engineering teams need paved paths, policy, and receipts.
A companion guide to the Codex Record & Replay video: OpenAI Codex can now record a recurring computer task and replay it as a reusable automation skill. Here is what the feature is and where it fits.
A companion guide to the GPT-5.5 video: OpenAI's newly released model rolling out to ChatGPT and Codex, reviewed through benchmarks, agent capabilities, context window, and pricing. Here is what the video covers and where to go deeper.
A companion guide to the OpenAI Codex video: a tour of the Codex desktop app, its plan and goal modes, plugins, multi-agent workflows, and UI annotation. Here is what the video shows and where to go deeper.
OpenAI's June 2026 API changelog looks like scattered platform plumbing. Read together, moderation scores, workload identity, Admin APIs, prompt-cache retention, container billing, and Secure MCP Tunnel are the pieces teams need to run agents with real controls.
Codex-Maxxing should mean bounded autonomy: AGENTS.md, small worktrees, explicit stop conditions, subagents only when work is separable, and review checkpoints that keep humans in control.
OpenAI's June deprecations put Agent Builder, hosted Evals, and reusable prompts on a November 30 shutdown path. Here is the practical migration plan: Agents SDK, repo-owned prompts, and eval receipts.
A Codex CLI SQLite logging bug showed how global TRACE logs can burn SSD write endurance. OpenAI has now merged fixes, but the incident is a useful local-agent operations lesson.
The Transformer co-creator leaves Google DeepMind for OpenAI just two years after Google paid $2.7 billion to bring him back from Character.AI.
OpenAI's mid-June 2026 Codex drop brings Computer Use to the EEA, UK, and Switzerland and adds selective Claude Code imports plus managed Bedrock auth to the CLI. Here is what actually shipped, verified against the changelog.
Same-day-verified llm api pricing june 2026: Claude Fable 5, GPT-5.5, Gemini 3.1 Pro, and DeepSeek V4 compared per million tokens, plus the three caveats that change the math.
GPT-5.4 vs Gemini 3.1 Pro vs DeepSeek V4: pricing, benchmarks, context behavior, and license terms for the mid-tier models that carry most production traffic.
GPT-5.5 vs Claude Opus 4.8: both cost $5 per million input tokens, so the workhorse-tier decision comes down to output pricing, benchmarks, and tooling.
Migrating off retired GPT models in 2026: the live retirement table, what maps to what, an eval-before-switch day plan, and when to jump providers.
The Codex changelog from April through June 2026 covers GPT-5.5, Goal mode going stable, Sites, a Chrome extension, Amazon Bedrock support, and mobile access from iOS. Here is what actually shipped and what it means in practice.
codex exec is OpenAI's non-interactive mode for running Codex agents from scripts, CI pipelines, and GitHub Actions - here is how to set it up safely with real flags and working YAML.
A practical comparison of OpenAI's Agents SDK and Anthropic's Claude Agent SDK - orchestration models, tool ecosystems, sandboxing, and how to choose the right platform for your team.
Four mature, production-ready TypeScript frameworks have made building agents genuinely enjoyable. Here is how to pick the right one - and how they fit together.
The ChatGPT for Google Sheets exfiltration report is not just a spreadsheet bug. It is a warning about agentic office tools: permissions need to be action-scoped, logged, revocable, and visible.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.