74 items
47 posts, 27 tools
Mistral releases Leanstral 1.5, an Apache-2.0 licensed 119B parameter model (6B active) for Lean 4 theorem proving that saturates miniF2F and achieves SOTA on FATE benchmarks.
The creator of Box2D releases Box3D - an open source 3D physics engine with cross-platform determinism, SIMD contact solving, and heritage from both Box2D and Valve's Rubikon engine.
The Godot Foundation has established a policy banning autonomous AI agent code and substantial AI-generated contributions, citing reviewer burnout and concerns about maintainer mentorship.
DeepReinforce AI released Ornith-1.0, a family of open-source coding models claiming self-improvement. The HN thread reveals a mix of skepticism and genuine interest - here is what the model actually does and whether the hype holds up.
A new project proposes a graphical shell layer for SSH that turns remote servers into browsable desktops. The HN discussion digs into architecture choices, the terminology debate, and whether this solves a real problem.
ZLUDA 6 adds PhysX support, textures for Blender, and better Windows experience. The catch? Commercial funding ended, so development follows what the author finds entertaining.
Semgrep's security research team benchmarked LLMs on IDOR vulnerability detection. The open-weight GLM 5.2 beat Claude Code by 7 points at roughly one-sixth the cost.
Bumblebee is Perplexity's open source scanner for detecting compromised packages, extensions, and MCP configs on developer machines. A read-only Go binary that checks npm, PyPI, Go modules, and 10+ ecosystems against exposure catalogs - without running any install scripts. Here is how to set it up and use it.
Justin Poehnelt spent seven years at Google building open-source developer tools. His CLI went viral, hit #1 on Hacker News, and got him fired two days before Google announced their own version.
Mistral OCR 4 and Baidu's Unlimited OCR both hit Hacker News today. The useful takeaway for developers is that OCR is no longer just text extraction. It is becoming a runtime decision for document agents.
OpenMontage is trending because it treats video production like a repo-shaped agent workflow: scripts, assets, render pipelines, review loops, and coding agents working across the whole process.
A developer used OpenAI Codex to build a fully open-source WYSIWYG editor for TikZ figures. The technical approach and reception on Hacker News offer a useful case study in what agent-built software looks like when shipped.
Baidu releases Unlimited OCR, an open-source vision-language model that parses 100+ page documents in a single pass without memory blowup. Here's what developers need to know.
Switzerland's fully open foundation model promises transparent training data and EU compliance. The HN crowd has questions about actual performance.
A YC W25 startup open-sources CADAM, a browser-based tool that converts natural language to parametric OpenSCAD models. HN debate: is text-to-CAD genuinely useful or just another demo?
Cohere shipped its first developer-facing model on June 9, 2026. North Mini Code is a 30B mixture-of-experts coding model with 3B active parameters, Apache 2.0 weights, and a deployment footprint of a single H100. Here is what it actually offers and where the open questions are.
Epic Games open-sourced Lore, a centralized version control system designed for binary-heavy game projects. It uses Merkle trees, on-demand file hydration, and native chunked storage to handle terabyte-scale repos that Git struggles with.
Z.ai shipped GLM-5.2 in mid-June with a usable 1M-token context window, two thinking-effort levels, and MIT open weights now released. Here is the setup guide for Claude Code, pricing breakdown, and what to test before the benchmarks arrive.
Kimi K2.7-Code is Moonshot's open-source 1T parameter coding model with 30% fewer reasoning tokens than K2.6. Here's how to set it up with Claude Code, pricing breakdown, and honest benchmark analysis.
OpenCode is the fastest-growing open-source AI coding agent - 160K GitHub stars, 7.5M monthly users, 75+ model providers. Here is how to set it up, configure models, and use it effectively in your workflow.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.