LOCAL AI

13 items

8 posts, 5 tools

BlogJun 23, 2026

GLM-5.2 Local Deployment: Running Z.ai's 744B Model on Consumer Hardware

Unsloth's dynamic quantization makes GLM-5.2 runnable on a 256GB Mac or a 24GB GPU with CPU offloading. Here is the hardware math, the quantization tradeoffs, and what the HN community learned from actually running it.

News Hacker News LLMs Open Weights Local AI Quantization

BlogJun 10, 2026

DiffusionGemma: Google Bets Diffusion Can Make Text Generation 4x Faster

Google released DiffusionGemma today, a 26B MoE open model that generates entire 256-token blocks in parallel instead of one token at a time. Here is what that means for latency, local inference, and the post-autoregressive landscape.

ai open-source inference local-ai google diffusion-models

BlogMay 7, 2026

What Is Cline? The Open-Source AI Coding Tool That Runs in VS Code

Cline is a free, open-source VS Code extension that brings autonomous AI coding to your editor. It works with local models or cloud APIs, handles multi-file changes, and runs terminal commands without proprietary lock-in.

AI Coding VS Code Open Source Developer Tools Local AI

BlogMay 2, 2026

Client-Side Tool Calling Is the Privacy Pattern AI Apps Need

A Show HN PDF form demo points at a bigger architecture shift: keep sensitive documents local, expose narrow browser tools to the model, and make AI assistance inspectable.

AI Agents Privacy Tool Calling Local AI Developer Architecture

ToolApr 9, 2026

Ollama

The easiest way to run LLMs locally. One command to pull and run any model. OpenAI-compatible API. 52M+ monthly downloads. Supports GGUF, Safetensors, and custom Modelfiles.

local-ai llm cli open-source self-hosted privacy

ToolApr 9, 2026

LM Studio

Desktop app for discovering, downloading, and running local LLMs. Clean chat UI, OpenAI-compatible API server, and automatic GPU detection. MLX engine optimized for Apple Silicon.

local-ai llm desktop gui apple-silicon open-source

ToolApr 9, 2026

Jan

Open-source ChatGPT alternative that runs 100% offline. Desktop app with local models, cloud API connections, custom assistants, and MCP integration. AGPLv3 licensed.

local-ai llm desktop open-source privacy offline mcp

ToolApr 9, 2026

GPT4All

Private local AI chatbot by Nomic. 250K+ monthly users, 65K GitHub stars. LocalDocs feature lets you chat with your own files. Runs on Windows, macOS, and Linux.

local-ai llm desktop privacy localdocs nomic

ToolApr 9, 2026

LocalAI

Open-source OpenAI API replacement. Runs LLMs, vision, voice, image, and video models on any hardware - no GPU required. 35+ backends. Distributed mode for scaling.

local-ai llm open-source self-hosted api-compatible multimodal

BlogMar 26, 2026

DeepSeek R1 and V3: The Developer's Guide to Open-Source AI

DeepSeek's R1 and V3 models deliver frontier-level performance under an MIT license. Here's how to use them through the API, run them locally with Ollama, and decide when they beat closed-source alternatives.

DeepSeek Open Source AI Models Local AI

BlogMar 26, 2026

Llama 4: The Complete Developer's Guide to Meta's Open Source Models

Meta's Llama 4 family brings mixture-of-experts to open source with Scout and Maverick. Here's how to run them locally, access them through APIs, and decide when they beat the competition.

Llama Meta Open Source AI Models Local AI

BlogAug 26, 2025

NVIDIA Nemotron Nano 9B V2: Local AI That Punches Up

NVIDIA's Nemotron Nano 9B V2 delivers something rare: a small language model that doesn't trade capability for speed. This 9B parameter model outperforms Qwen 3B across instruction following, math,...

NVIDIA Nemotron Local AI Open Source

BlogJan 9, 2025

Microsoft PHI-4: A 14B Parameter Model That Rivals Models 5x Its Size

Microsoft's PHI-4 is an MIT-licensed 14 billion parameter model that matches Llama 3.3 70B and Qwen 2.5 72B on key benchmarks. Here is what makes it special, how to run it locally, and why small language models are increasingly practical for real development work.

Microsoft PHI-4 Open Source AI LLM Ollama Local AI

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags