58 items
57 posts, 1 guide
A companion guide to the Nimbalyst video: an open-source visual workspace that runs Codex and Claude Code from your existing subscriptions, with a Kanban board, a planning workflow, and AI commits. Here is what it does and where it fits.
A companion guide to the Codex Record & Replay video: OpenAI Codex can now record a recurring computer task and replay it as a reusable automation skill. Here is what the feature is and where it fits.
A companion guide to the GPT-5.5 video: OpenAI's newly released model rolling out to ChatGPT and Codex, reviewed through benchmarks, agent capabilities, context window, and pricing. Here is what the video covers and where to go deeper.
A companion guide to the OpenAI Codex video: a tour of the Codex desktop app, its plan and goal modes, plugins, multi-agent workflows, and UI annotation. Here is what the video shows and where to go deeper.
Grok Build is xAI's agentic CLI with 8 parallel subagents, a plan-first workflow, and Arena Mode for competing outputs. Installation, pricing, real commands, and how it compares to Claude Code and Codex.
Aharness, LangChain's custom harness pattern, and OpenAI's code-first migration all point to the same next step: agent processes need typed gates, validated evidence, and controlled transitions.
Codex-Maxxing should mean bounded autonomy: AGENTS.md, small worktrees, explicit stop conditions, subagents only when work is separable, and review checkpoints that keep humans in control.
A new layer is forming around Claude Code, Codex, Copilot CLI, and local memory tools: the local coding agent workspace. It is not the model. It is the bench where agents get supervised.
OpenMontage is trending because it treats video production like a repo-shaped agent workflow: scripts, assets, render pipelines, review loops, and coding agents working across the whole process.
A developer used OpenAI Codex to build a fully open-source WYSIWYG editor for TikZ figures. The technical approach and reception on Hacker News offer a useful case study in what agent-built software looks like when shipped.
A trending Codex SQLite WAL bug is a useful warning for every local coding agent: logs, disks, background processes, and telemetry paths need budgets too.
A Codex CLI SQLite logging bug showed how global TRACE logs can burn SSD write endurance. OpenAI has now merged fixes, but the incident is a useful local-agent operations lesson.
Codex can point at OpenAI-compatible model providers, local Ollama servers, and internal model proxies. Here is the practical config pattern, the sharp edges, and when to use it.
Goal, loop, routine. Three verbs, two tools, one hard part. A complete field guide to running agentic loops in Claude Code and Codex, the real commands, the patterns people actually run, and the two failure modes that burn money.
GitHub's Agent Finder discovers and invokes Claude, Codex, MCP servers, and skills automatically. Here is how the new ARD specification changes AI coding tool integration.
Databricks open-sourced Omnigent, a meta-harness that sits above individual agent CLIs so your sessions, policies, and skills are not locked inside any single tool. Here is what it does, how to install it, and where it fits if you already run Claude Code and Codex.
OpenAI's mid-June 2026 Codex drop brings Computer Use to the EEA, UK, and Switzerland and adds selective Claude Code imports plus managed Bedrock auth to the CLI. Here is what actually shipped, verified against the changelog.
GitHub's latest agent workspace trend points at a boring but important primitive: agents need explicit filesystem contracts before they get more tools.
The Codex changelog from April through June 2026 covers GPT-5.5, Goal mode going stable, Sites, a Chrome extension, Amazon Bedrock support, and mobile access from iOS. Here is what actually shipped and what it means in practice.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.