REASONING

12 items

2 posts, 9 tools, 1 guide

BlogJun 23, 2026

VibeThinker-3B: A 3 Billion Parameter Model That Outscores Opus 4.5 on Reasoning

A new paper shows a 3B parameter model hitting 94.3 on AIME26 and 96.1% on LeetCode contests - matching or exceeding models 100x its size. The catch: it traded general knowledge for pure reasoning ability.

News Hacker News LLMs Small Models AI Research Reasoning

ToolJun 11, 2026

Claude Fable 5

Anthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 per million tokens. Built for long-horizon agentic work.

ai model anthropic mythos-class agents 1m-context reasoning

BlogApr 29, 2026

Extended Thinking in Claude: When Deep Reasoning Pays For Itself

A production guide to Claude's extended thinking mode. Real cost math, TypeScript SDK code, and the tasks where reasoning tokens are worth 3x the spend.

Claude API Anthropic SDK Extended Thinking Reasoning Cost Optimization

ToolApr 29, 2026

Claude Opus 4.7

Anthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token context window. Available via API and in Claude Code.

model anthropic reasoning coding long-context

ToolApr 23, 2026

DeepSeek V3.2

DeepSeek's reasoning-first model built for agents. First model to integrate thinking directly into tool use. Ships alongside V3.2-Speciale, which rivals GPT-5 and Gemini 3.0 Pro.

ai model open-source deepseek reasoning agents tool-use

GuideApr 23, 2026

Effort Levels - Claude Code

Low, medium, high, xhigh, and max for adaptive reasoning control.

ToolApr 1, 2026

Gemini

Google's frontier model family. Gemini 2.5 Pro has 1M token context and top-tier coding benchmarks. Gemini 3 Pro pushes reasoning further. Free tier via AI Studio.

ai model google reasoning coding 1m-context multimodal

ToolApr 1, 2026

Grok

xAI's model with real-time X/Twitter data access. Grok 3 rivals top models on reasoning. Built-in web search and current events awareness. Available via API.

ai model xai real-time reasoning web-search

ToolMar 28, 2026

GPT-5

OpenAI's latest flagship model. Major leap in reasoning, coding, and instruction following over GPT-4o. Powers ChatGPT Plus/Pro and the API. Available via API and ChatGPT.

ai model openai reasoning coding flagship

ToolMar 25, 2026

DeepSeek

Open-source reasoning models from China. DeepSeek-R1 rivals o1 on math and code benchmarks. V3 for general use. Fully open weights. Extremely cost-effective API.

ai model open-source reasoning coding cost-effective

ToolMar 22, 2026

ChatGPT

OpenAI's flagship. GPT-4o for general use, o3 for reasoning, Codex for coding. 300M+ weekly users. Tasks, agents, web browsing, DALL-E, code interpreter.

ai model openai chat agents reasoning

Tool

Claude

Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding model I've tested. Max plan ($200/mo).

ai model anthropic reasoning coding 200k-context

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags