
TL;DR
Opus 4.7 vs GPT-5.5, the new Codex CLI vs the Claude skills ecosystem. An opinionated April 2026 verdict on which terminal agent to reach for, by job.
Direct answer
Opus 4.7 vs GPT-5.5, the new Codex CLI vs the Claude skills ecosystem. An opinionated April 2026 verdict on which terminal agent to reach for, by job.
Best for
Developers comparing real tool tradeoffs before choosing a stack.
Covers
Verdict, tradeoffs, pricing signals, workflow fit, and related alternatives.
Always verify current pricing and features against the official documentation:
| Tool | Documentation | Pricing | Changelog |
|---|---|---|---|
| Claude Code | code.claude.com/docs | anthropic.com/pricing | Claude updates |
| Codex | developers.openai.com/codex | Codex pricing | Codex changelog |
Pricing and model access change frequently. The official pages are the source of truth.
Last updated: June 11, 2026. Verify pricing, plan limits, and CLI behavior against the official sources before standardizing a team workflow.
In April 2026, the terminal-agent question is no longer "which CLI is more capable." Both Claude Code and Codex are competent enough to ship real production work in real repos. The question now is which one fits which job - because the two products have visibly diverged.
Claude Code optimizes for extensibility on top of a planning model. Opus 4.7 is the thinking head; skills, sub-agents, hooks, MCP servers, and plugins are the body. The bet is that you will want to bend the agent to your repo and your team.
Codex optimizes for a tightly integrated agent loop with strong defaults. GPT-5.5, the rebuilt Codex CLI, the new app-server, the in-app browser, and the automatic reviewer are designed to behave well out of the box without much customization.
Both bets are reasonable. They lead to different daily ergonomics.
If your main question is longer-running work, the sharper follow-up is Codex /goal vs Claude Managed Outcomes: Codex is moving toward persistent execution loops, while Claude's managed-agent outcomes are moving toward rubric-graded task closure.
A quick state-of-the-world before the verdict, because anything older than April is already stale.
Anthropic released Claude Opus 4.7 on April 16. Roughly 13% better than Opus 4.6 on a 93-task internal coding benchmark, with stronger vision and noticeably more taste on UI and document tasks. The official Claude docs list Opus 4.7 at $5 / $25 per million tokens, Sonnet 4.6 at $3 / $15, and Haiku 4.5 at $1 / $5 (verified June 11, 2026 against the Claude models overview). Since this post was written, Anthropic shipped Claude Opus 4.8 at the same price point - see the What changed section at the end. For full Claude Code documentation, see the official Claude Code docs.
OpenAI released GPT-5.5 on April 24. Inside Codex, OpenAI explicitly says it produces better results with fewer tokens than GPT-5.4 (see OpenAI pricing). The Codex changelog over the last month also added Unix socket transport for the app-server, sticky environments, remote plugin install, automatic reviewer agents that gate risky approvals, in-app browser hand-off for local dev servers, and codex exec --json reasoning-token output. For full Codex CLI documentation, see the official Codex CLI docs.
Google shipped Gemini 3 Pro and Antigravity on April 22. Relevant context, but it does not change the head-to-head between the two terminal agents.
This is closer than the marketing suggests. On hard, multi-file refactors in real repos, both Opus 4.7 and GPT-5.5 produce working diffs most of the time. The differences:
Net: if you measure SWE-bench-style numbers, they look similar. If you measure your own happiness on a Tuesday, the personalities diverge.
# Same task, two agents
claude -p "add a /healthz endpoint with 200 OK and a tiny test"
codex exec "add a /healthz endpoint with 200 OK and a tiny test"
For tasks at that altitude, Codex usually finishes first.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Apr 28, 2026 • 10 min read
Apr 28, 2026 • 9 min read
Apr 28, 2026 • 9 min read
Apr 28, 2026 • 6 min read
This is where Claude Code is currently in a different league.
The skills ecosystem became real this month. If you want a fast snapshot of what people are building, browse the community-curated claudemarketplaces.com directory and the open-source claude-code-plugins-plus-skills marketplace repo. For the current extension model, start with the Claude Code plugins documentation:
~/.claude/skills/deploy-vercel/SKILL.md
A plugin bundles skills, MCP servers, slash commands, and sub-agents into one installable unit. Hooks let you run shell commands at lifecycle events (see hooks documentation). Sub-agents let you fan work out cleanly. None of this requires SDK code.
Codex's plugin model exists - the recent changelog added remote plugin install and marketplace upgrades - but it is younger, smaller, and less culturally embedded. If you want a community library to copy from on day one, Claude Code wins.
If your team already has an AGENTS.md or DESIGN.md and a folder of skills, that investment compounds in Claude Code. Move to Codex and most of it does not transfer.
Codex catches up here, and arguably surpasses Claude Code.
The new automatic reviewer agent in Codex CLI gates risky approvals through a separate agent before they execute. Permission profiles round-trip across TUI sessions, user turns, MCP sandbox state, and shell escalation. The in-app browser lets Codex click through a real local app to verify a fix. codex exec --json reports reasoning-token usage so you can budget cost programmatically.
Claude Code's hook system is more flexible (you can run any shell command on PreToolUse, PostToolUse, Stop), but Codex's defaults out of the box are tighter. If you want a junior teammate to run an agent and not break prod, Codex is the safer first install.
Use this as a decision frame, not a price calculator:
For pricing tiers, see our Q2 2026 AI coding tools pricing breakdown.
Pick Claude Code when:
AGENTS.md / CLAUDE.md / DESIGN.md and want the agent to actually read themPick Codex when:
Use both when:
Here is the configuration most heavy users I trust are running this week.
~/.claude/settings.json:
{
"model": "claude-opus-4-8",
"subagent_model": "claude-haiku-4-5"
}
~/.codex/config.toml:
model = "gpt-5.5"
auto_review = true
Then alias them so your fingers pick the right tool:
alias plan="claude" # ambiguous, big-picture
alias do="codex" # tight, well-scoped
It sounds silly. It works.
Both products are converging on "agent that reads your repo, plans, edits, runs, verifies." They will keep getting closer on raw ability. The differentiation is going to be:
If I had to bet, the team that wins is the team whose users build things on top of it without permission. That favors Claude Code in the long run. But Codex's April 2026 release is the closest the gap has been, and on a strict cost-per-task basis it is currently the better default for "small, scoped" coding work.
For a deeper field comparison including Cursor and OpenCode, see our four-way matchup.
Verified June 11, 2026 against the Claude models overview, the Opus 4.8 announcement, and the Codex changelog:
Neither is universally better - they optimize for different workflows. Claude Code excels at ambiguous, planning-heavy tasks where you want the agent to think deeply before acting. It has a mature skills and plugin ecosystem for team customization. Codex excels at well-scoped tasks where you want fast execution with strong defaults and built-in safety rails. For most developers, the answer is "use both" - Claude Code for architecture and planning, Codex for tight implementation work.
Codex and Claude Code cost depend on model choice, token use, and whether you are using API pricing or a flat-rate subscription. Claude's docs list Opus 4.8 and Opus 4.7 at $5 / $25 per million tokens, Sonnet 4.6 at $3 / $15, and Haiku 4.5 at $1 / $5 (verified June 11, 2026). For Codex, check OpenAI API pricing and Codex pricing before making a cost call.
Codex runs GPT-5.5 (released April 24, 2026), which OpenAI says produces better results with fewer tokens than GPT-5.4. Claude Code runs Claude Opus 4.7 (released April 16, 2026), roughly 13% better than Opus 4.6 on coding benchmarks with stronger vision capabilities. As of a June 11, 2026 check, Codex still runs GPT-5.5, while Claude Code now defaults to Claude Opus 4.8 (released May 28, 2026) at the same $5 / $25 price point.
Yes, and many developers do exactly this. A common pattern: use Claude Code for planning and architectural decisions, then use Codex for tightly scoped follow-up tasks. Both agents can commit to the same branch, and you review the combined PR. This dual-agent approach gives you second opinions and plays to each tool's strengths.
Codex currently has tighter defaults. The automatic reviewer agent gates risky approvals, permission profiles persist across sessions, and the in-app browser lets it verify changes by clicking through your local dev server. Claude Code's hook system is more flexible - you can run any shell command at lifecycle events - but requires more setup. For teams new to agent CLIs, Codex is the safer first install.
Claude Code wins here decisively. The community library of skills and plugins is broader and more mature. Codex's plugin model exists and recently added remote install, but it is younger and smaller. If you want community resources to copy from on day one, choose Claude Code.
Claude Code optimizes for extensibility on top of a planning model - skills, sub-agents, hooks, MCP servers, and plugins are first-class concepts. The bet is that you will customize the agent to your repo and team. Codex optimizes for a tightly integrated agent loop with strong defaults - the model, CLI, app-server, browser, and reviewer are designed to work well out of the box without much configuration.
Use Claude Code when: the task is ambiguous and benefits from planning (redesigning auth flow, splitting a monolith), you want to invest in skills and hooks as team infrastructure, you already have AGENTS.md or CLAUDE.md files you want the agent to respect, you care about UI and visual taste (Opus 4.7's vision output is better), or you want to run multi-agent fan-outs from one orchestrator.
Use Codex when: the task is well-scoped (fix this test, write this function), you want strong defaults without much configuration, you need the in-app browser to verify changes visually, cost-per-task matters and you lack a flat-rate plan, or your team is new to agent CLIs and you want fewer footguns.
Read next
Anthropic shipped Fable 5 and a June 22 subscription cliff. OpenAI shipped GPT-5.5 inside Codex plus automations, browser use, and computer control. Here is the honest June 2026 update on which tool fits which developer.
9 min readA deep comparison of Codex's new /goal loop and Claude managed agents outcomes, with practical workflow examples, control tradeoffs, and migration guidance for long-running tasks.
18 min readA deep comparison of Claude Code and OpenAI Codex app based on official docs and product updates: execution model, security controls, pricing, workflows, and when each wins.
11 min readTechnical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Mac app for running parallel Claude Code, Codex, and Cursor agents in isolated workspaces. Watch every agent work at onc...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolOpenAI's coding agent for terminal, cloud, IDE, GitHub, Slack, and Linear workflows. Reads repos, edits files, runs comm...
View ToolOpenAI's open-source terminal coding agent built in Rust. Runs locally, reads your repo, edits files, and executes comma...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppInspect Claude Code transcripts to see which files, tools, and tokens are filling the context window.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppLimit which tools a subagent can access.
Claude CodeConfigure Claude Code for maximum productivity -- CLAUDE.md, sub-agents, MCP servers, and autonomous workflows.
AI AgentsDeep comparison of the top AI agent frameworks - LangGraph, CrewAI, Mastra, CopilotKit, AutoGen, and Claude Code.
AI Agents
Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Composio: Connect AI Agents to 1,000+ Apps via CLI (Gmail, Google Docs/Sheets, Hacker News Workflows) Check out Composio here: http://dashboard.composio.dev/?utm_source=Youtube&utm_channel=0426&utm_...

Anthropic has released Channels for Claude Code, enabling external events (CI alerts, production errors, PR comments, Discord/Telegram messages, webhooks, cron jobs, logs, and monitoring signals) to b...
Anthropic shipped Fable 5 and a June 22 subscription cliff. OpenAI shipped GPT-5.5 inside Codex plus automations, browse...

A deep comparison of Codex's new /goal loop and Claude managed agents outcomes, with practical workflow examples, contro...

A deep comparison of Claude Code and OpenAI Codex app based on official docs and product updates: execution model, secur...

Terminal agent, IDE agent, local-plus-cloud agent. Three architectures compared - how to decide which fits your workflow...

Updated 2026 comparison of Aider and Claude Code using official docs and current workflow patterns: architecture, contro...

OpenAI's April 2026 Codex changelog shows a clear product shift: Codex is becoming a full agent workspace with goals, br...

Codex works from the terminal, cloud tasks, IDEs, GitHub, Slack, and Linear. Here is how to use it and how it compares t...

A developer's comparison of OpenAI and Anthropic ecosystems - models, coding tools, APIs, pricing, and which to choose f...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.