TL;DR
Migrating off retired GPT models in 2026: the live retirement table, what maps to what, an eval-before-switch day plan, and when to jump providers.
Read next
A verified directory of the frontier AI models in June 2026 - Claude Fable 5, GPT-5.5, GPT-5.4, Gemini 3.1 Pro, and DeepSeek V4 - with pricing checked against official docs.
10 min readFable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you when the premium pays off.
7 min readThe AI coding market is noisy. The changes that matter are easier to spot when you separate model capability, editor loops, terminal agents, background agents, agent frameworks, UI layers, context, security, and cost.
10 min readLast updated: June 11, 2026
OpenAI is running its largest model cleanup since the GPT-3 era. On April 22, 2026 it deprecated more than a dozen models in one sweep - GPT-4, GPT-4o snapshots, the o-series, and five Codex variants - then added the GPT-5.2 and 5.3 chat aliases on May 8. The first shutdowns land July 23, 2026.
This checklist is built from the live deprecations page as of June 11, 2026: what maps to what, the API diffs that bite, an eval-before-switch plan, the cost table, and when a retirement is the right moment to leave the platform entirely.
OpenAI's deprecations page defines its terms precisely. "Deprecated" takes effect the moment it is announced - the model still works, but the countdown has started. "Shut down" means requests stop resolving. Stated notice: at least 6 months for GA models, 3 months for specialized variants, as little as 2 weeks for previews.
Two things people get wrong:
The base GPT-5.x models are not deprecated. As of June 11, 2026, there are no deprecation rows for gpt-5, gpt-5.1, gpt-5.2, gpt-5.3, or their mini, nano, and pro variants. What is retiring are the -chat-latest aliases and the Codex-branded variants. If you pinned a base model, you have no deadline yet.
The dates cluster into three waves. July 23, 2026 takes the GPT-5 and 5.1 chat aliases, all five Codex variants - gpt-5.2-codex included - and both deep-research models. August 10 takes the 5.2 and 5.3 chat aliases. October 23 is the big one: GPT-4, GPT-4o, GPT-3.5 Turbo, o1, o1-pro, o3-mini, and o4-mini.
Every row comes from the official deprecations page, accessed June 11, 2026:
| Retiring model | Deprecated | Shuts down | OpenAI's replacement |
|---|---|---|---|
| gpt-5-chat-latest, gpt-5.1-chat-latest | Apr 22, 2026 | Jul 23, 2026 | gpt-5.5 |
| gpt-5-codex, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.2-codex | Apr 22, 2026 | Jul 23, 2026 | gpt-5.5 |
| gpt-5.1-codex-mini | Apr 22, 2026 | Jul 23, 2026 | gpt-5.4-mini |
| o3-deep-research, o4-mini-deep-research | Apr 22, 2026 | Jul 23, 2026 | gpt-5.5-pro |
| gpt-5.2-chat-latest, gpt-5.3-chat-latest | May 8, 2026 | Aug 10, 2026 | gpt-5.5 |
| gpt-4o-2024-05-13 | Apr 22, 2026 | Oct 23, 2026 | gpt-5.5 |
| gpt-4-0613 (and aliases) | Apr 22, 2026 | Oct 23, 2026 | gpt-5.5 |
| o1, o3-mini | Apr 22, 2026 | Oct 23, 2026 | gpt-5.5 |
| o1-pro | Apr 22, 2026 | Oct 23, 2026 | gpt-5.5-pro |
| o4-mini | Apr 22, 2026 | Oct 23, 2026 | gpt-5.4-mini |
| gpt-3.5-turbo-0125 | Apr 22, 2026 | Oct 23, 2026 | gpt-5.4-mini |
The pattern: almost everything funnels into gpt-5.5, pro-tier reasoning goes to gpt-5.5-pro, and small cheap models go to gpt-5.4-mini. The mapping is a starting point, not a verdict - o4-mini to gpt-5.4-mini is like-for-like, but gpt-4-0613 to gpt-5.5 jumps roughly three model generations and your prompts will behave differently.
Some retirements already happened: codex-mini-latest on February 12, 2026, chatgpt-4o-latest on February 17, the GPT-4o audio and realtime previews on May 7, DALL-E 2 and 3 on May 12. Check older code paths for these first.
The biggest non-model deadline: the Assistants API, deprecated August 26, 2025, shuts down August 26, 2026, replaced by the Responses and Conversations APIs. Model swaps are one-line changes; this is an architecture change - threads, runs, and tool orchestration all move. Our Responses API migration walkthrough covers the mapping. Do it before the model swap so the eval suite only runs once.
The lesson of this wave: -chat-latest aliases retire on their own schedule regardless of the underlying model's status. Pin dated snapshots in production and keep aliases in dev only. Your inventory grep should look for both forms.
o1, o1-pro, o3-mini, o4-mini, and both deep-research models all disappear by October 23. The o-series had distinct latency and deliberation behavior that teams tuned prompts around. When those prompts move to GPT-5.5, treat them as unverified - recommended replacement does not mean behavioral equivalent.
On June 3, 2026, OpenAI deprecated its hosted Evals platform, Agent Builder, and the Reusable Prompts API, all shutting down November 30, 2026. If your plan was to run comparisons in OpenAI's hosted evals UI, that tool is itself on a deprecation clock. Keep the eval harness in your own repo, pointed at whatever provider you like.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 11, 2026 • 10 min read
Jun 11, 2026 • 10 min read
Jun 10, 2026 • 7 min read
Jun 10, 2026 • 8 min read
A model swap without evals is a production incident with extra steps. The plan fits in one week:
Day 1 - Inventory. Grep for every model string: gpt-4, gpt-4o, gpt-3.5, o1, o3, o4, codex, -chat-latest. Check the usage dashboard for models called from code you forgot - old Lambdas and cron jobs are where retired models hide. Tag each call site with its shutdown date.
Day 2 - Build the golden set. Pull 50 to 200 real production requests per call site, with outputs you considered good. Skew toward edge cases and past failures. Capture latency and token counts as your baseline.
Day 3 - Run side by side. Replay the golden set against the recommended replacement and at least one alternative (leaving o4-mini, test both gpt-5.4-mini and gpt-5.4). Score with whatever fits: exact match for structured output, a judge model for prose, your test suite for code.
Day 4 - Re-run the cost math. Token counts change across model generations, and cached-input pricing changes the equation for repeated system prompts. Compute real cost per request from the Day 3 runs, not sticker prices.
Day 5 - Staged cutover. Put the model name in config, not code. Ship behind a flag at 10 percent of traffic, compare error rates and output quality, then ramp. Keep the old model in the fallback path until its shutdown date - that is what the deprecation window is for.
Replacement-target pricing from the official pricing page, accessed June 11, 2026, per million tokens:
| Model | Input | Cached input | Output |
|---|---|---|---|
| gpt-5.5 | $5.00 | $0.50 | $30.00 |
| gpt-5.5-pro | $30.00 | - | $180.00 |
| gpt-5.4 | $2.50 | $0.25 | $15.00 |
| gpt-5.4-mini | $0.75 | $0.075 | $4.50 |
| gpt-5.4-nano | $0.20 | $0.02 | $1.25 |
| gpt-5.4-pro | $30.00 | - | $180.00 |
Batch processing is a flat 50 percent discount across the family. The retired models no longer appear on the pricing page at all - one more reason to capture per-request cost on Day 2 while the old model still runs.
Two routing notes. First, gpt-5.4 at $2.50/$15 is half the price of gpt-5.5 - if your gpt-4o workload never pushed capability limits, evaluate the cheaper target first. Second, cached input at one tenth of the base rate rewards putting static system prompts and tool definitions first in every request. The cross-provider picture is in our frontier model API pricing roundup; the GPT-5.5 developer guide covers what changes at the API surface.
If you consume these models through Microsoft Foundry (formerly Azure OpenAI), the dates above do not apply to you. Azure publishes its own model lifecycle and retirement policy: GA models get an 18-month lifecycle, new-customer access ends at 12 months, and retired models return 410 Gone.
The sharpest difference is auto-upgrades. Azure force-upgrades Standard deployments at retirement - when gpt-4o versions 2024-05-13 and 2024-08-06 retired on March 31, 2026, those deployments were auto-upgraded to gpt-5.1. A silent three-generation jump is not something you want to learn about from a customer ticket. Set versionUpgradeOption deliberately: NoAutoUpgrade means the deployment stops at retirement, which is at least a loud failure. Provisioned deployments are never auto-upgraded. Fine-tuned models get their own schedule - gpt-4o fine-tune deployments survive on Azure until October 1, 2027.
A forced migration is the cheapest moment to re-evaluate the platform decision - you are paying the eval and cutover cost anyway. Testing a third provider on Day 3 costs one more column in the results spreadsheet.
Alternatives at the tiers people are leaving, per official pricing pages accessed June 11, 2026 (input/output per MTok):
| Tier you are leaving | OpenAI target | Anthropic option | Open-weights option |
|---|---|---|---|
| gpt-4o / chat aliases | gpt-5.5 ($5/$30) | Claude Opus 4.8 ($5/$25) | DeepSeek-V4-Pro ($0.435/$0.87) |
| o4-mini / gpt-3.5 | gpt-5.4-mini ($0.75/$4.50) | Claude Haiku 4.5 ($1/$5) | DeepSeek-V4-Flash ($0.14/$0.28) |
| Mid-tier workhorse | gpt-5.4 ($2.50/$15) | Claude Sonnet 4.6 ($3/$15) | DeepSeek-V4-Flash ($0.14/$0.28) |
Anthropic's pricing is structurally similar to OpenAI's - 0.1x cache reads, 50 percent batch discount - and Opus 4.8 undercuts gpt-5.5 on output at the same input price; start with our Fable 5 migration notes if that is the path. DeepSeek's V4 pricing is the aggressive option; its API speaks both OpenAI and Anthropic wire formats, making it cheap to include in a bake-off - see the deepseek-chat to V4 migration guide.
The honest caveat: switching providers does not get you off the deprecation treadmill. DeepSeek deprecates its legacy deepseek-chat and deepseek-reasoner model names on July 24, 2026. Anthropic lists Opus 4.1, Opus 4, and Sonnet 4 as deprecated right now. Every provider runs this cycle; the variables are notice and mapping quality.
Jumping providers is the wrong call more often than the pricing tables suggest. Stay if any of these hold:
The October wave is large but the process is the same at any scale: inventory, golden set, side-by-side, cost math, staged cutover. Budget one week per major call site and these deadlines are comfortable, not scary.
No. As of June 11, 2026, the deprecations page lists no shutdown for gpt-5.4 or any base GPT-5.x model. The retirements cover -chat-latest aliases, Codex variants, the o-series, and GPT-4-generation models.
OpenAI's recommendation for gpt-4o-2024-05-13 is gpt-5.5, with a shutdown date of October 23, 2026. If the workload was cost-sensitive rather than capability-bound, evaluate gpt-5.4 too - run both against a golden set first.
August 26, 2026, one year after its deprecation announcement. The replacements are the Responses and Conversations APIs. It is an architectural migration, not a model swap, so schedule it before model changes on the same code paths.
No. Microsoft Foundry runs an independent 18-month lifecycle - Azure retired gpt-4o versions 2024-05-13 and 2024-08-06 on March 31, 2026, months ahead of OpenAI's October 23 API date, and auto-upgrades Standard deployments at retirement unless versionUpgradeOption says otherwise.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
OpenAI's flagship. GPT-4o for general use, o3 for reasoning, Codex for coding. 300M+ weekly users. Tasks, agents, web br...
View ToolUnified API for 200+ models. One API key, one billing dashboard. OpenAI, Anthropic, Google, Meta, Mistral, and more. Aut...
View ToolOpenAI's latest flagship model. Major leap in reasoning, coding, and instruction following over GPT-4o. Powers ChatGPT P...
View ToolFactory AI's terminal coding agent. Runs Anthropic and OpenAI models in one subscription. Handles full tasks end-to-end...
View ToolTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppBeat the August 2026 Assistants API sunset. Paste old code, get Responses API.
View AppPick a model in 30 seconds. Built for the answer, not the marketing.
View AppInstall Ollama and LM Studio, pull your first model, and run AI locally for coding, chat, and automation - with zero cloud dependency.
Getting StartedExecute shell commands with persistent working directory in project bounds.
Claude CodeA concrete step-by-step guide to moving your development workflow from Cursor to Claude Code - settings, rules, keybindings, and the habits that transfer.
Getting StartedA verified directory of the frontier AI models in June 2026 - Claude Fable 5, GPT-5.5, GPT-5.4, Gemini 3.1 Pro, and Deep...
Fable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you...
Fable 5 1M context workflows that actually work: whole-repo reviews, log archaeology, multi-doc synthesis - plus the hon...
Fable 5 effort levels explained: what low, medium, high, xhigh, and max actually change, which models support each level...
A practical playbook for running Claude Fable 5 as the orchestrator over Sonnet and Haiku workers, with verified cost ma...
Twelve documented Claude Fable 5 use patterns - agent orchestration, overnight runs, 1M-context refactors, effort tuning...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.