43 items
22 posts, 21 tools
Constrained generation library for LLMs. Uses finite state machines to mask invalid tokens during generation. Guarantees schema-compliant output with zero retries.
The easiest way to run LLMs locally. One command to pull and run any model. OpenAI-compatible API. 52M+ monthly downloads. Supports GGUF, Safetensors, and custom Modelfiles.
Desktop app for discovering, downloading, and running local LLMs. Clean chat UI, OpenAI-compatible API server, and automatic GPU detection. MLX engine optimized for Apple Silicon.
Open-source OpenAI API replacement. Runs LLMs, vision, voice, image, and video models on any hardware - no GPU required. 35+ backends. Distributed mode for scaling.
DeepSeek's R1 and V3 models deliver frontier-level performance under an MIT license. Here's how to use them through the API, run them locally with Ollama, and decide when they beat closed-source alternatives.
Meta's Llama 4 family brings mixture-of-experts to open source with Scout and Maverick. Here's how to run them locally, access them through APIs, and decide when they beat the competition.
Open-source reasoning models from China. DeepSeek-R1 rivals o1 on math and code benchmarks. V3 for general use. Fully open weights. Extremely cost-effective API.
Meta's open-source model family. Llama 4 available in Scout (17B active) and Maverick (17B active, 128 experts). Free to use, modify, and deploy commercially.
Aider is open source and works with any model. Claude Code is Anthropic's commercial agent. Here is how they compare for TypeScript.
AI coding agents are submitting pull requests to open source repos - and some CONTRIBUTING.md files now contain prompt injections targeting them.
Open-source Firebase alternative built on Postgres. Auth, real-time subscriptions, storage, edge functions, and pgvector for AI embeddings. Generous free tier.
NVIDIA's Nemotron 3 Super combines latent mixture of experts with hybrid Mamba architecture - 120B total parameters, 12B active per token, 1M context, and up to 4x more experts at the same cost.
NVIDIA's Nemotron Nano 9B V2 delivers something rare: a small language model that doesn't trade capability for speed. This 9B parameter model outperforms Qwen 3B across instruction following, math,...
Rebuilding or redesigning an existing website typically means starting from scratch. You audit the content, wireframe new layouts, and spend hours translating ideas into code.
OpenAI has released its first open-weight models in over five years. GPT-OSS 12B and GPT-OSS 20B are now available under the Apache 2.0 license, marking a significant shift in strategy for the comp...
Alibaba released Qwen 3 with eight models under an Apache 2 license, including a 235B mixture-of-experts flagship that beats Llama 4 Maverick on nearly every benchmark while being smaller and cheaper to run.
Unstract is an open-source, no-code platform for extracting structured data from PDFs, invoices, scanned documents, and more. Here is how it works, how to set it up, and why automated document processing is becoming essential for organizations drowning in unstructured data.
Google's open-source coding CLI. Free tier with Gemini 2.5 Pro. Supports tool use, file editing, shell commands. 1M token context window.

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.