Llama
Meta's open-source model family. Llama 4 available in Scout (17B active) and Maverick (17B active, 128 experts). Free to use, modify, and deploy commercially.
Llama is Meta's family of open-source language models and the foundation of the open-weight AI ecosystem. Llama 4 introduced mixture-of-experts with Scout (109B total, 17B active parameters) and Maverick (400B total, 17B active), delivering strong performance with efficient inference. The models are free for commercial use, which has made them the default choice for companies that need to self-host or fine-tune. The ecosystem around Llama is massive, with support in every major inference framework, fine-tuning toolkit, and deployment platform. You can run smaller variants locally through Ollama, or deploy the full models on your own GPU infrastructure. For developers who need full control over their model stack without licensing restrictions, Llama is the starting point.
Similar Tools
DeepSeek
Open-source reasoning models from China. DeepSeek-R1 rivals o1 on math and code benchmarks. V3 for general use. Fully open weights. Extremely cost-effective API.
Qwen3-Coder
Alibaba's flagship open-weight coding model. 480B total parameters, 35B active (MoE). Native 256K context, scales to 1M. Apache 2.0 license. State-of-the-art agentic coding.
DeepSeek V3.2
DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships alongside V3.2-Speciale, which rivals GPT-5 and Gemini 3.0 Pro.
Claude
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding model I've tested. Max plan ($200/mo).
Get started with Llama
Meta's open-source model family. Llama 4 available in Scout (17B active) and Maverick (17B active, 128 experts). Free to use, modify, and deploy commercially.
Try LlamaGet weekly tool reviews
Honest takes on AI dev tools, frameworks, and infrastructure - delivered to your inbox.
Subscribe FreeMore AI Models Tools
Claude
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding model I've tested. Max plan ($200/mo).
ChatGPT
OpenAI's flagship. GPT-4o for general use, o3 for reasoning, Codex for coding. 300M+ weekly users. Tasks, agents, web browsing, DALL-E, code interpreter.
OpenRouter
Unified API for 200+ models. One API key, one billing dashboard. OpenAI, Anthropic, Google, Meta, Mistral, and more. Automatic fallbacks and load balancing.
Related Guides
Run AI Models Locally with Ollama and LM Studio
Install Ollama and LM Studio, pull your first model, and run AI locally for coding, chat, and automation - with zero cloud dependency.
Getting StartedModel Aliases - Claude Code
Use opus, sonnet, haiku, and best to switch models easily.
Claude CodeGetting Started with DevDigest CLI
Install the dd CLI and scaffold your first AI-powered app in under a minute.
Getting StartedRelated Posts

SAM 3.1: Realtime Video Segmentation in Apps
SAM 3.1 finally hits the latency budget for realtime video. Here is how to wire Meta's new segmentation model into a pro...

Claude Opus 4.7: The Developer's Guide to Anthropic's New Flagship
Opus 4.7 is here. Sharper coding, longer agentic runs, better tool use, and a price that finally makes Opus livable for...

Flagship: Cloudflare Feature Flags for AI Apps
Cloudflare Flagship is feature flags built for AI: model swaps, agent gates, and prompt rollouts as first-class primitiv...

Gemma 4: The Open Model Guide for Developers
Gemma 4 ships byte-for-byte open weights from Google DeepMind. How developers deploy it locally, fine-tune it, and ship...

Mercury 2 Developer Guide: Building With a Diffusion LLM in Production
A hands-on developer guide to Mercury 2 from Inception Labs. OpenAI-compatible API, reasoning levels, tool use, structur...

Vercel's Agentic Infrastructure Stack Explained
Vercel just declared the agent stack: AI Gateway, Sandbox, Flags, and Microfrontends. Here is how the four primitives co...
