Qwen 3 Coder: Alibaba's Coding-Optimized LLM

Q: How do I install and use Qwen Code CLI?

Install via npm: `npm install -g @qwen-code/qwen-code`. Then configure your API key from OpenRouter, Alibaba Cloud, or another provider by setting the base URL and model identifier. Qwen Code is forked from Gemini CLI but customized with specialized prompts and function-calling protocols designed for Qwen 3 Coder's agentic capabilities.

The New Open-Source Standard for Coding LLMs#

Alibaba's Qwen team has released Qwen 3 Coder, a 480-billion-parameter mixture-of-experts model that sets a new bar for open-source coding assistants. With 35 billion active parameters and support for context windows scaling to one million tokens, this model doesn't just compete with proprietary alternatives - it beats them on several key benchmarks.

For model-selection context, compare this with Claude vs GPT for Coding: Which Model Writes Better TypeScript? and OpenAI vs Anthropic in 2026 - Models, Tools, and Developer Experience; the useful question is not only benchmark quality, but where the model fits in a real developer workflow.

Benchmark comparison showing Qwen 3 Coder vs Claude 4 Sonnet and Kimi K2

The numbers tell a clear story. On TerminalBench, Qwen 3 Coder outperforms Claude 4 Sonnet. On SWE-bench Verified, it scores 69.6 against Claude 4's 70.4 - functionally a tie. Agentic browser use is nearly identical between the two models, and while Qwen 3 Coder trails slightly on agentic tool use, it remains within striking distance. Perhaps most telling is the comparison to Kimi K2, which scored 65.4 on SWE-bench: Qwen 3 Coder clears that bar with room to spare.

This represents a dramatic acceleration in capability. Just months ago, DeepSeek R1 was the benchmark everyone discussed. Now an open model matches or exceeds Claude 4 Sonnet across most coding tasks.

Architecture and Training at Scale#

Qwen 3 Coder was trained on 7.5 trillion tokens, 70% of which were code-specific. The team employed synthetic data generation to filter noisy training data, significantly improving overall data quality. The model natively supports 256,000 tokens but extends to one million using YaRN extrapolation - optimized specifically for repository-scale coding and dynamic data like pull requests.

Architecture diagram showing MoE structure and token routing

Unlike models optimized for competitive programming puzzles, Qwen 3 Coder focuses on real-world software engineering tasks suited for execution-driven reinforcement learning. The team scaled code RL training across a broad spectrum of practical coding scenarios rather than cherry-picking benchmark-friendly problems.

The post-training pipeline introduces long-horizon reinforcement learning to handle multi-turn interactions with development environments. Training an agentic coding model requires massive environmental scale - Alibaba spun up 20,000 independent environments running in parallel across their cloud infrastructure. This setup provided the feedback loops necessary for large-scale RL and supported evaluations at scale. The result: state-of-the-art performance among open-source models on SWE-bench and related benchmarks.

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Create Beautiful UI with Claude Code: The Style Guide Method

Jul 21, 2025 • 8 min read

ChatGPT Agent: OpenAI's Operator Meets Deep Research

Jul 17, 2025 • 7 min read

Grok 4: xAI's Most Powerful AI Model

Jul 10, 2025 • 7 min read

Claude Code: The Future of Coding?

Jul 5, 2025 • 9 min read

Speed and Tooling#

While hybrid reasoning and test-time compute dominate headlines, Qwen 3 Coder prioritizes inference speed - a critical factor when running inside AI IDEs or agentic coding tools. Fast feedback loops matter when you're iterating on code.

Alibaba released Qwen Code alongside the model, a CLI tool forked from Gemini CLI but customized with specialized prompts and function-calling protocols designed specifically for Qwen 3 Coder. The tool handles agentic coding tasks out of the box.

Integration extends beyond Alibaba's official tooling. Qwen 3 Coder works with:

Cline and similar AI coding assistants
Claude Code (using Alibaba Cloud Model Studio API keys)
Any IDE supporting custom base URLs and model strings
OpenRouter and other third-party providers

Getting Started#

The fastest way to test Qwen 3 Coder is through the official web interface at chat.qwen.ai. The platform offers free access with an artifacts feature that renders generated web applications directly in the browser - useful for quickly prototyping 3D visualizations, physics simulations, or interactive demos.

Example of generated web app with 3D physics simulation

For local CLI usage:

Install

npm install -g @qwen-code/qwen-code

Then configure your API key from OpenRouter, Alibaba Cloud, or another provider by setting the base URL and model identifier to point at Qwen 3 Coder.

To use with Claude Code, obtain an API key from Alibaba Cloud Model Studio, install Claude Code, and point it at the Qwen-compatible proxy endpoint. Cline users can similarly swap in the model through its provider configuration.

What This Means for Developers#

Qwen 3 Coder arrives at a moment when open-source models are closing the gap with proprietary alternatives faster than expected. The model's strength on SWE-bench - a benchmark requiring multi-turn planning, tool use, and environment interaction - suggests it handles real software engineering workflows, not just code completion.

Agentic workflow showing multi-turn RL training environment

The combination of competitive performance, million-token context windows, and permissive open licensing gives teams a viable alternative to closed APIs for agentic coding workflows. Whether you're building automated devtools, running an AI-powered IDE, or experimenting with code generation agents, Qwen 3 Coder deserves evaluation.

The rapid progression from DeepSeek R1 to Kimi K2 to Qwen 3 Coder - each leapfrogging the previous state of the art within months - suggests the pace of improvement in coding models isn't slowing. If anything, it's accelerating.

Official Sources#

Resource	Link
Qwen Chat Interface	chat.qwen.ai
Qwen3-Coder GitHub Repository	github.com/QwenLM/Qwen3-Coder
Qwen 3 Model Card (Hugging Face)	huggingface.co/Qwen
Alibaba Cloud Model Studio	alibabacloud.com/en/product/modelstudio
Qwen Technical Blog	qwenlm.github.io
OpenRouter (Third-Party Provider)	openrouter.ai

FAQ#

How does Qwen 3 Coder compare to Claude 4 Sonnet and GPT-5?#

Qwen 3 Coder matches or exceeds Claude 4 Sonnet on most coding benchmarks. On SWE-bench Verified, it scores 69.6 compared to Claude 4's 70.4 - functionally equivalent. On TerminalBench, Qwen 3 Coder outperforms Claude 4 Sonnet. The model trails slightly on agentic tool use but remains competitive. This represents a major milestone for open-source models reaching parity with proprietary alternatives.

What is the architecture of Qwen 3 Coder?#

Qwen 3 Coder is a 480-billion-parameter mixture-of-experts (MoE) model with 35 billion active parameters during inference. It was trained on 7.5 trillion tokens, with 70% being code-specific data. The model natively supports 256K context windows and extends to 1 million tokens using YaRN extrapolation, optimized for repository-scale coding and dynamic data like pull requests.

Can I run Qwen 3 Coder locally?#

Qwen 3 Coder is available through multiple deployment options. You can access it via Alibaba Cloud Model Studio with an API key, through OpenRouter and other third-party providers, or through the official Qwen Code CLI tool. Due to its 480B parameter size, local deployment requires substantial hardware, though the 35B active parameter design makes inference more manageable than full-scale models.

What makes Qwen 3 Coder different from other coding models?#

Unlike models optimized for competitive programming puzzles, Qwen 3 Coder focuses on real-world software engineering tasks. Alibaba trained it using execution-driven reinforcement learning across 20,000 parallel development environments, providing massive-scale feedback loops for practical coding scenarios rather than benchmark-optimized problems.

How do I install and use Qwen Code CLI?#

Install via npm: npm install -g @qwen-code/qwen-code. Then configure your API key from OpenRouter, Alibaba Cloud, or another provider by setting the base URL and model identifier. Qwen Code is forked from Gemini CLI but customized with specialized prompts and function-calling protocols designed for Qwen 3 Coder's agentic capabilities.

What IDEs and tools work with Qwen 3 Coder?#

Qwen 3 Coder integrates with Cline, Claude Code (using Alibaba Cloud Model Studio API keys), any IDE supporting custom base URLs and model strings, and OpenRouter. The official Qwen Code CLI handles agentic coding tasks out of the box with preconfigured prompts for the model.

Is Qwen 3 Coder open source?#

Yes, Qwen 3 Coder uses permissive open licensing. Model weights are available on Hugging Face, and the codebase is accessible on GitHub. This gives teams a viable alternative to closed APIs for agentic coding workflows, AI-powered IDEs, and code generation agents.

What is the post-training pipeline for agentic capabilities?#

Qwen 3 Coder uses long-horizon reinforcement learning to handle multi-turn interactions with development environments. Alibaba ran 20,000 independent environments in parallel across their cloud infrastructure to provide the feedback loops necessary for training agentic behaviors. This setup supports repository-scale tasks requiring planning, tool use, and environment interaction.

Watch the Video#

The New Open-Source Standard for Coding LLMs#

Architecture and Training at Scale#

Newsletter

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools, delivered free every week.

From the archive

Speed and Tooling#

Integration extends beyond Alibaba's official tooling. Qwen 3 Coder works with:

Cline and similar AI coding assistants
Claude Code (using Alibaba Cloud Model Studio API keys)
Any IDE supporting custom base URLs and model strings
OpenRouter and other third-party providers

Getting Started#

For local CLI usage:

Install

npm install -g @qwen-code/qwen-code

Then configure your API key from OpenRouter, Alibaba Cloud, or another provider by setting the base URL and model identifier to point at Qwen 3 Coder.

What This Means for Developers#

Official Sources#

Resource	Link
Qwen Chat Interface	chat.qwen.ai
Qwen3-Coder GitHub Repository	github.com/QwenLM/Qwen3-Coder
Qwen 3 Model Card (Hugging Face)	huggingface.co/Qwen
Alibaba Cloud Model Studio	alibabacloud.com/en/product/modelstudio
Qwen Technical Blog	qwenlm.github.io
OpenRouter (Third-Party Provider)	openrouter.ai

The New Open-Source Standard for Coding LLMs#

Architecture and Training at Scale#

Create Beautiful UI with Claude Code: The Style Guide Method