
TL;DR
Claude is now GA in Microsoft Foundry on Azure with native billing, Entra ID auth, and GB300 Blackwell infrastructure. Here is the full developer setup - CCU pricing, SDK examples, deployment options, and what enterprise teams need to know.
| Resource | Link |
|---|---|
| Claude in Microsoft Foundry docs | platform.claude.com/docs/en/build-with-claude/claude-in-microsoft-foundry |
| Microsoft Foundry portal | ai.azure.com |
| Azure Foundry pricing | azure.microsoft.com/pricing/details/microsoft-foundry |
| Anthropic pricing | claude.com/pricing |
| Microsoft deployment guide | learn.microsoft.com/azure/foundry/foundry-models/how-to/use-foundry-models-claude |
| Azure blog announcement | azure.microsoft.com/blog/claude-in-microsoft-foundry-is-now-generally-available |
As of June 29, 2026, Claude Opus 4.8 and Claude Haiku 4.5 are generally available in Microsoft Foundry on Azure. This is the first time enterprise teams can access Claude with full Azure-native billing, identity management, and governance - no separate Anthropic contract required for initial deployment.
The practical implication: if your organization already runs on Azure with an Enterprise Agreement, you can start using Claude today with charges appearing on your existing Azure invoice. Microsoft Azure Consumption Commitment (MACC) applies.
Last updated: June 30, 2026
Microsoft Foundry is Azure's unified AI platform - the successor to Azure AI Studio. On June 29, Anthropic announced general availability of Claude models in Foundry with two hosting options:
Hosted on Azure (GA) - Claude runs on Anthropic-operated infrastructure within Azure data centers. Prompts and completions stay within Azure; only usage metadata and safety-flagged content egress to Anthropic. Available models: Claude Opus 4.8 and Claude Haiku 4.5.
Hosted on Anthropic (Preview) - Claude runs on Anthropic's own infrastructure. Supports all Claude models including Fable 5 and the full Opus/Sonnet lineup. Use this for features not yet available on Azure-hosted deployments.
The Hosted on Azure option runs on NVIDIA GB300 Blackwell Ultra systems, which Microsoft positions for autonomous and domain-specific AI agents.
| Model | Hosted on Azure | Hosted on Anthropic |
|---|---|---|
| Claude Fable 5 | - | Yes |
| Claude Opus 4.8 | Yes | Yes |
| Claude Opus 4.7 | - | Yes |
| Claude Opus 4.6 | - | Yes |
| Claude Opus 4.5 | - | Yes |
| Claude Sonnet 4.6 | - | Yes |
| Claude Sonnet 4.5 | - | Yes |
| Claude Haiku 4.5 | Yes | Yes |
All models on Foundry have 1M-token context windows except Claude Sonnet 4.5, which has 200k.
Microsoft Foundry bills through Azure Marketplace using Claude Consumption Units (CCUs). The conversion is straightforward: 100 CCU = $1.00 USD of Claude usage at standard Anthropic API rates.
| Billing detail | How it works |
|---|---|
| Billing unit | Claude Consumption Unit (CCU) |
| CCU price | $0.01 per CCU (fixed) |
| Conversion | Token usage rated at standard Anthropic per-MTok rates, then converted to CCUs |
| Billing cadence | Hourly metering to Azure Marketplace; monthly invoices |
| Payment model | Postpaid only - no prepaid credits |
| Discounts | Applied as fewer CCUs metered |
| MACC eligible | Yes |
The underlying token pricing matches Anthropic's standard API rates:
| Model | Input ($/MTok) | Output ($/MTok) |
|---|---|---|
| Fable 5 | $10 | $50 |
| Opus 4.8 | $5 | $25 |
| Sonnet 4.6 | $3 | $15 |
| Haiku 4.5 | $1 | $5 |
Prompt caching multipliers apply: 5-minute cache writes cost 1.25x input, 1-hour cache writes cost 2x input, cache hits cost 0.1x input. Batch API discounts (50% off) are available for async workloads.
US Data Zone pricing: Using the US Data Zone Standard deployment type adds a 1.1x multiplier to all token pricing. This keeps inference within the United States.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 30, 2026 • 6 min read
Jun 30, 2026 • 8 min read
Jun 30, 2026 • 7 min read
Jun 30, 2026 • 8 min read
{resource} in API endpointsYour endpoint URL format: https://{resource}.services.ai.azure.com/anthropic/v1/*
claude-opus-4-8)Foundry is supported by Python, TypeScript, C#, Java, and PHP SDKs.
Python:
pip install -U "anthropic"
TypeScript:
npm install @anthropic-ai/foundry-sdkC#:
dotnet add package Anthropic.Foundry
Java (Gradle):
implementation("com.anthropic:anthropic-java-foundry:2.40.0")
Set environment variables:
export ANTHROPIC_FOUNDRY_API_KEY="your-api-key"
export ANTHROPIC_FOUNDRY_RESOURCE="your-resource-name"
Python example:
import os
from anthropic import AnthropicFoundry
client = AnthropicFoundry(
api_key=os.environ.get("ANTHROPIC_FOUNDRY_API_KEY"),
resource="example-resource",
)
message = client.messages.create(
model="claude-opus-4-8",
max_tokens=1024,
messages=[{"role": "user", "content": "Hello!"}],
)
print(message.content)
TypeScript example:
import AnthropicFoundry from "@anthropic-ai/foundry-sdk";
const client = new AnthropicFoundry({
apiKey: process.env.ANTHROPIC_FOUNDRY_API_KEY,
resource: "example-resource"
});
const message = await client.messages.create({
model: "claude-opus-4-8",
max_tokens: 1024,
messages: [{ role: "user", content: "Hello!" }]
});
console.log(message.content);
cURL example:
curl https://{resource}.services.ai.azure.com/anthropic/v1/messages \
-H "content-type: application/json" \
-H "api-key: YOUR_AZURE_API_KEY" \
-H "anthropic-version: 2023-06-01" \
-d '{
"model": "claude-opus-4-8",
"max_tokens": 1024,
"messages": [
{"role": "user", "content": "Hello!"}
]
}'
For enterprise deployments, use Microsoft Entra ID (formerly Azure Active Directory) for centralized access management via Azure RBAC.
Python with Entra ID:
import os
from anthropic import AnthropicFoundry
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
token_provider = get_bearer_token_provider(
DefaultAzureCredential(), "https://ai.azure.com/.default"
)
client = AnthropicFoundry(
resource="example-resource",
azure_ad_token_provider=token_provider,
)
message = client.messages.create(
model="claude-opus-4-8",
max_tokens=1024,
messages=[{"role": "user", "content": "Hello!"}],
)
print(message.content)
cURL with Entra ID:
ACCESS_TOKEN=$(az account get-access-token --resource https://ai.azure.com --query accessToken -o tsv)
curl https://{resource}.services.ai.azure.com/anthropic/v1/messages \
-H "content-type: application/json" \
-H "Authorization: Bearer $ACCESS_TOKEN" \
-H "anthropic-version: 2023-06-01" \
-d '{
"model": "claude-opus-4-8",
"max_tokens": 1024,
"messages": [
{"role": "user", "content": "Hello!"}
]
}'
When using Hosted on Azure deployments, the following features are not available:
Requests that use these features return a 400 Bad Request error. Claude Code automatically detects Hosted on Azure deployments and adapts its feature set.
For full feature parity, use Hosted on Anthropic deployments instead.
Azure provides native observability for Claude usage:
For debugging, include both request-id and apim-request-id response headers when contacting support.
To move from Hosted on Anthropic to Hosted on Azure (or vice versa):
model parameterIf the new deployment is in the same Foundry resource, your endpoint URL and authentication stay unchanged.
On June 29, Governor Newsom announced that California signed a first-of-its-kind agreement with Anthropic giving state agencies, cities, and counties access to Claude at a 50% discount. The deal includes free workforce training, technical assistance, and workflow help from Anthropic developers.
This partnership runs through Azure Marketplace, using the same CCU billing structure. Public sector organizations in California should contact their Anthropic or Microsoft account representative for access.
Use Microsoft Foundry when:
Use the direct Anthropic API when:
The underlying token rates are identical. Foundry adds no markup - you pay standard Anthropic rates, converted to CCUs for Azure billing. The main difference is payment method (Azure invoice vs Anthropic billing) and MACC eligibility.
Yes. Claude Code detects Hosted on Azure deployments automatically and adapts its feature set. Some features like server-side tools are not available, but core coding assistance works.
A CCU is a billing unit for Azure Marketplace. 100 CCU = $1.00 USD of Claude usage at standard rates. CCUs are metered hourly and invoiced monthly in arrears.
Prompt caching is available with the same multipliers as the direct API. Batch API with 50% discount is available. Fast mode is not available on Claude Platform on AWS but should be checked for Foundry availability.
Currently, Claude Opus 4.8 and Claude Haiku 4.5 are available on Hosted on Azure. Other models including Fable 5 and the Sonnet family are available on Hosted on Anthropic (preview).
Use the US Data Zone Standard deployment type when creating your deployment. This keeps inference within the United States and applies a 1.1x pricing multiplier.
Yes. The SDK changes are minimal - swap Anthropic for AnthropicFoundry and provide your resource name. API request/response formats are identical.
Include both request-id and apim-request-id response headers when contacting support to help teams locate your request across both Anthropic and Azure systems.
Read next
Every major AI coding tool just went through a pricing shift. Here are the exact numbers for Cursor, GitHub Copilot, Claude Code, Devin, and the Anthropic API - verified from live pricing pages on July 1, 2026. Fable 5 is back online today after export controls were lifted.
9 min readMillion-token context, agent teams that coordinate without an orchestrator, and benchmark scores that push the frontier. Opus 4.6 is Anthropic's biggest model drop yet.
8 min readFrom terminal agents to cloud IDEs - these are the AI coding tools worth using for TypeScript development in 2026.
8 min readTechnical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 pe...
View ToolAnthropic's recommended default for complex work, released May 28, 2026. 1M context, 128K output, $5/$25 per million tok...
View ToolA hosted infinite canvas your headless AI agents drive over MCP. Any MCP-speaking agent - Claude Code, Codex, Cursor, or...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppConfigure Claude Code for maximum productivity -- CLAUDE.md, sub-agents, MCP servers, and autonomous workflows.
AI AgentsA concrete step-by-step guide to moving your development workflow from Cursor to Claude Code - settings, rules, keybindings, and the habits that transfer.
Getting StartedThe primary command-line entry point for Claude Code sessions.
Claude Code
Open Design: Open-Source n8n App That Turns Any Website into a Brand Kit, Design System, HTML + Images The video introduces Open Design, an MIT-licensed full-stack template that combines AI and n8n a...

Anthropic Suspends Fable 5 & Mythos 5 After US Export Control Directive (Jailbreak Concerns) Anthropic announced that the US government issued export control directives requiring it to suspend Fable ...

Claude Fable 5 Released: Benchmarks, Pricing, Availability, and Real-World Examples Anthropic has released Claude Fable 5, the first general-use “Mythos class” model, and the video reviews the announ...

Every major AI coding tool just went through a pricing shift. Here are the exact numbers for Cursor, GitHub Copilot, Cla...

Million-token context, agent teams that coordinate without an orchestrator, and benchmark scores that push the frontier....

From terminal agents to cloud IDEs - these are the AI coding tools worth using for TypeScript development in 2026.

Uber burned through its entire 2026 AI tools budget by April. Microsoft is canceling Claude Code licenses company-wide....

Claude Code fast mode pricing explained: $10/$50 per MTok on Opus 4.8, the first-enable context charge, separate rate li...

Same-day-verified llm api pricing june 2026: Claude Fable 5, GPT-5.5, Gemini 3.1 Pro, and DeepSeek V4 compared per milli...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.