TL;DR
Claude Fable 5 vs Gemini: how Anthropic's $10/$50 Mythos-class model compares to Gemini 3.1 Pro's $2/$12 preview on pricing, context, and benchmarks.
Direct answer
Claude Fable 5 vs Gemini: how Anthropic's $10/$50 Mythos-class model compares to Gemini 3.1 Pro's $2/$12 preview on pricing, context, and benchmarks.
Best for
Developers comparing real tool tradeoffs before choosing a stack.
Covers
Verdict, tradeoffs, pricing signals, workflow fit, and related alternatives.
Read next
Same-day-verified llm api pricing june 2026: Claude Fable 5, GPT-5.5, Gemini 3.1 Pro, and DeepSeek V4 compared per million tokens, plus the three caveats that change the math.
10 min readGPT-5.4 vs Gemini 3.1 Pro vs DeepSeek V4: pricing, benchmarks, context behavior, and license terms for the mid-tier models that carry most production traffic.
8 min readFable 5 launched June 9 at 2x GPT-5.5's price with a 22-point SWE-Bench Pro gap. Here is the decision framework for choosing between them.
7 min readLast updated: June 11, 2026
Anthropic shipped Claude Fable 5 on June 9, 2026, and after the OpenAI comparisons the next question is the Google one. Search "claude fable 5 vs gemini" right now and you get thin results, because the two models sit in different price classes and most coverage treats them separately. For teams not on OpenAI, these are the two frontier options that matter this month: Fable 5 at $10 per million input tokens and $50 per million output with a flat-priced 1M context window, and Gemini 3.1 Pro Preview at $2/$12 with a price step above 200K tokens. Add the pre-GA noise around Gemini 3.5 Pro's claimed 2M context, and the decision deserves a careful look.
We have already covered how Fable 5 stacks up against OpenAI's lineup and against DeepSeek V4 on cost versus quality. This post covers the Google side, with every number checked against the live pricing pages.
Gemini 3.1 Pro is the value play and Fable 5 is the capability play, and the gap in both directions is large enough that most teams will not agonize over this choice for long.
Gemini 3.1 Pro costs a fifth of Fable 5 on input and roughly a quarter on output below 200K context, and it posts genuinely strong scores - 80.6% on SWE-bench Verified and a verified 77.1% on ARC-AGI-2. Fable 5 is the first publicly available Mythos-class model, and where both have published numbers it leads by 10 to 20 points. You pay for that lead on every token, and Fable 5 is noticeably slower in interactive use.
The wrinkle is context pricing: Anthropic bills the full 1M window flat, while Google doubles the input price above 200K tokens. That narrows the cost gap on long-context work but never closes it. Details below.
All pricing verified June 11, 2026 against the live pages: Anthropic's pricing docs and the Gemini API pricing page.
| Claude Fable 5 | Gemini 3.1 Pro Preview | |
|---|---|---|
| Status | GA, released June 9, 2026 | Preview since February 19, 2026 |
| API model ID | claude-fable-5 | gemini-3.1-pro-preview |
| Input price (per MTok) | $10 flat across full context | $2.00 up to 200K / $4.00 above 200K |
| Output price (per MTok) | $50 flat | $12.00 up to 200K / $18.00 above 200K |
| Cache read | $1.00 | $0.20 up to 200K / $0.40 above, plus $4.50 per MTok per hour storage |
| Batch pricing | $5 input / $25 output | $1.00-$2.00 input / $6.00-$9.00 output |
| Context window | 1M tokens | 1M tokens |
| Max output | 128K tokens | 64K tokens |
Two notes the table hides. First, Anthropic states that Fable 5's full 1M window bills at standard pricing - a 900K-token request costs the same per-token rate as a 9K one. Second, the same page notes the tokenizer Fable 5 inherits from Opus 4.7 can produce up to 35% more tokens for the same text versus older Claude models, so sticker prices understate its real cost if your baseline was measured on an older tokenizer.
The headline framing you will see elsewhere is "Claude charges flat, Google penalizes long context." That is true but easy to over-read, so here is the actual math, verified June 11, 2026 against the Gemini API pricing page and Anthropic's pricing docs.
A typical request under the step - 150K input tokens, 8K output:
Fable 5 costs roughly 4.8x more. Now a long-context request - 500K input tokens, 10K output:
The gap narrows to about 2.5x, because Gemini's input price doubles while Fable 5 stays flat. So the honest version: Gemini 3.1 Pro is cheaper at every context length. Fable 5's flat pricing is not a discount, it is predictability - no step function in your cost model when an agent's context crosses 200K mid-session. And if your workloads live under 200K tokens, Gemini's higher tier never even triggers. For deeper modeling on the Anthropic side, see our Fable 5 cost-per-task analysis.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 11, 2026 • 9 min read
Jun 11, 2026 • 10 min read
Jun 11, 2026 • 10 min read
Jun 11, 2026 • 10 min read
On paper this is not close, but the numbers come from different reporters and partially different benchmark versions, so treat the table as directional.
| Benchmark | Claude Fable 5 | Gemini 3.1 Pro |
|---|---|---|
| SWE-bench Verified | 95.0% | 80.6% |
| Terminal-Bench | 88.0% (v2.1) | 68.5% (v2.0) |
| Humanity's Last Exam (no tools) | 59.0% | 44.4% |
| ARC-AGI-2 | not published | 77.1% (verified) |
| GPQA Diamond | not published | 94.3% |
| OSWorld-Verified | 85.0% | not published |
Fable 5's figures come from kingy.ai's benchmark roundup, which compiles Anthropic and Vals data; Gemini's from nxcode's Gemini 3.1 Pro guide and Google's announcement. The Terminal-Bench versions differ (2.1 vs 2.0), neither vendor publishes the other's headline metric, and Anthropic's own claim is the unsurprising "state-of-the-art on nearly all tested benchmarks". The consistent 10-20 point spread across independent compilations is the signal, not any single cell.
Qualitative reports back the spread. Simon Willison, after 5.5 hours of release-day testing, called Fable 5 "a beast" that is "slow, expensive" but so capable that "the challenge is finding tasks that it can't do." Meanwhile Gemini 3.1 Pro's verified 77.1% on ARC-AGI-2 - more than double its predecessor's reasoning score per Google - is a real result on a benchmark designed to resist memorization. These are different strengths: Fable 5 is built for long autonomous runs, Gemini 3.1 Pro for strong reasoning at workhorse prices.
One Fable 5 quirk to know before committing: it ships safety classifiers covering cybersecurity, biology/chemistry, and distillation, and requests that trip them are handled by Claude Opus 4.8 instead. Anthropic reports more than 95% of sessions involve no fallback, but in those domains part of your traffic will quietly run on a different model.
Gemini 3.1 Pro is unusually easy to reach: Google lists the Gemini API via AI Studio, Gemini CLI, Android Studio, the Antigravity agent platform, Vertex AI, Gemini Enterprise, the Gemini app, and NotebookLM as launch surfaces. If you live in the terminal, our Gemini CLI guide covers the workflow. The catch is the label: nearly four months after release it is still gemini-3.1-pro-preview, with no stable GA model ID, which matters if your org has policies about preview models in production.
Fable 5 went GA on day one via the Claude API, and Anthropic included it on Pro, Max, Team, and seat-based Enterprise plans at no extra cost through June 22 - after June 23 it requires usage credits. If you want to feel the difference before committing API budget, this window is the cheapest way to do it.
The strongest argument for waiting is the model Google has not shipped. Gemini 3.5 Pro was reported in early June to be nearing launch with a claimed 2M-token context window and a Deep Think reasoning mode, targeting general availability in June 2026, with expected pricing around $15/$60 per MTok. As of June 11, 2026, there is no Gemini 3.5 Pro row on the live Gemini API pricing page - only Gemini 3.5 Flash, at $1.50/$9.00. As TechTimes puts it, until independent testers can evaluate it, "its capabilities remain Google's claims rather than verified performance."
Two things follow. If those numbers hold, Google's next flagship lands in Fable 5's price class, not Gemini 3.1 Pro's, which reframes the "Google is the cheap option" assumption. And pre-GA claims are not a plan: choose between the models you can call today, then re-evaluate when 3.5 Pro has a pricing row and third-party numbers.
Skip Fable 5 if your workload is mostly routine completions, latency-sensitive, or budget-capped - at 2.5x to 5x Gemini's price, plus a tokenizer that can inflate counts up to 35% versus older Claude models, the premium only pays when the capability gap saves you failed runs or human rework. Also think twice if you work heavily in cybersecurity or life-science domains, where classifier fallbacks reroute part of your traffic.
Skip Gemini 3.1 Pro if you need more than 64K output tokens per call, if preview status is a compliance blocker, or if you have already watched cheaper frontier models fail on your hardest agentic tasks - the benchmark spread suggests Fable 5 will convert some of those failures, and a converted failure is usually worth more than the token savings.
And stay where you are if your current model completes your tasks reliably. Both of these launches reward teams that measure completion rates before switching, not after.
On published numbers, yes by a wide margin: 95.0% vs 80.6% on SWE-bench Verified and 88.0% vs 68.5% on Terminal-Bench, though the Terminal-Bench versions differ (2.1 vs 2.0) and the figures come from different compilers. The gap matters most on long autonomous coding runs; for routine completions, Gemini 3.1 Pro is strong and far cheaper.
At standard rates verified June 11, 2026: $10 vs $2.00 per million input tokens (5x) and $50 vs $12.00 per million output (about 4.2x) below 200K context. Above 200K, Gemini moves to $4.00/$18.00 while Fable 5 stays flat, narrowing the total-cost gap to roughly 2.5x on input-heavy requests.
Yes. The live Gemini API pricing page lists $2.00 input/$12.00 output for prompts up to 200K tokens and $4.00/$18.00 for prompts above it, with cache reads also doubling from $0.20 to $0.40 per MTok. Anthropic prices Fable 5's full 1M window flat.
Probably not. Gemini 3.5 Pro's 2M context and Deep Think mode are pre-GA claims with expected pricing around $15/$60 per MTok - which would put it in Fable 5's price class, not Gemini 3.1 Pro's. It has no row on the live pricing page as of June 11, 2026. Decide between shipping models and revisit when it is GA with independent numbers.
Yes, and the economics often argue for it: route routine and interactive work to Gemini 3.1 Pro at $2/$12 and reserve Fable 5's $10/$50 for the long-horizon tasks where cheaper models fail. Both providers offer 50% batch discounts and prompt caching to tune costs further.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Open-source AI pair programming in your terminal. Works with any LLM - Claude, GPT, Gemini, local models. Git-aware ed...
View ToolGoogle's frontier model family. Gemini 2.5 Pro has 1M token context and top-tier coding benchmarks. Gemini 3 Pro pushes...
View ToolAnthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppDeep comparison of the top AI agent frameworks - LangGraph, CrewAI, Mastra, CopilotKit, AutoGen, and Claude Code.
AI AgentsThe primary command-line entry point for Claude Code sessions.
Claude CodeReal-time prompt loop with history, completions, and multiline input.
Claude Code
Claude Fable 5 Released: Benchmarks, Pricing, Availability, and Real-World Examples Anthropic has released Claude Fable 5, the first general-use “Mythos class” model, and the video reviews the announ...

Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Claude Design by Anthropic: Generate a Design System From Your Repo + Build High-Fidelity UI Fast The video reviews Claude Design by Anthropic, calling it a highly differentiated product, and demonst...
Claude Agent SDK vs Claude Code explained: same engine, two surfaces. Here is the concrete decision line, plus where Man...
Same-day-verified llm api pricing june 2026: Claude Fable 5, GPT-5.5, Gemini 3.1 Pro, and DeepSeek V4 compared per milli...
GPT-5.4 vs Gemini 3.1 Pro vs DeepSeek V4: pricing, benchmarks, context behavior, and license terms for the mid-tier mode...
GPT-5.5 vs Claude Opus 4.8: both cost $5 per million input tokens, so the workhorse-tier decision comes down to output p...
Anthropic broke its own naming ladder when it introduced the Mythos class and Claude Fable 5. Here is what the shift mea...
Claude Desktop spawns a Hyper-V virtual machine consuming roughly 1.8 GB of RAM on every Windows launch - even when you...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.