Large language models - benchmarks, capabilities, and how to choose the right one.
15 resources - 5 posts, 9 tools, 1 guide

DeepSeek's R1 and V3 models deliver frontier-level performance under an MIT license. Here's how to use them through the API, run them locally with Ollama, and decide when they beat closed-source alternatives.

Meta's Llama 4 family brings mixture-of-experts to open source with Scout and Maverick. Here's how to run them locally, access them through APIs, and decide when they beat the competition.

Claude Opus 4.6 vs GPT-5.3 for real TypeScript work. Benchmarks, pricing, context windows, and practical differences.
A developer's comparison of OpenAI and Anthropic ecosystems - models, coding tools, APIs, pricing, and which to choose for different use cases.

NVIDIA's Nemotron 3 Super combines latent mixture of experts with hybrid Mamba architecture - 120B total parameters, 12B active per token, 1M context, and up to 4x more experts at the same cost.
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding model I've tested. Max plan ($200/mo).
AI ModelsOpenAI's flagship. GPT-4o for general use, o3 for reasoning, Codex for coding. 300M+ weekly users. Tasks, agents, web browsing, DALL-E, code interpreter.
AI ModelsUnified API for 200+ models. One API key, one billing dashboard. OpenAI, Anthropic, Google, Meta, Mistral, and more. Automatic fallbacks and load balancing.
AI ModelsOpen-source reasoning models from China. DeepSeek-R1 rivals o1 on math and code benchmarks. V3 for general use. Fully open weights. Extremely cost-effective API.
AI ModelsMeta's open-source model family. Llama 4 available in Scout (17B active) and Maverick (17B active, 128 experts). Free to use, modify, and deploy commercially.
AI ModelsEuropean open-weight models. Mistral Large for complex tasks, Mistral Small for speed, Codestral for code. Strong multilingual support. Open and API options.
AI ModelsOpenAI's latest flagship model. Major leap in reasoning, coding, and instruction following over GPT-4o. Powers ChatGPT Plus/Pro and the API. Available via API and ChatGPT.
AI ModelsGoogle's frontier model family. Gemini 2.5 Pro has 1M token context and top-tier coding benchmarks. Gemini 3 Pro pushes reasoning further. Free tier via AI Studio.
AI ModelsxAI's model with real-time X/Twitter data access. Grok 3 rivals top models on reasoning. Built-in web search and current events awareness. Available via API.
AI ModelsNew tutorials, open-source projects, and deep dives on coding agents - delivered weekly.
Explore 149 topics
Browse All Topics