SELF-HOSTING

7 items

7 posts

BlogJul 29, 2026

Buzz: Block's Agent-Native Messaging Layer on Nostr

Block open-sourced Buzz, a team workspace where agents are cryptographic identities instead of bot tokens. Every message is a signed Nostr event, the relay is yours to run, and the CLI is JSON in, JSON out.

Agents Nostr Open Source Block Multi-Agent Systems Self-Hosting

BlogJul 4, 2026

Jamesob's Guide to Running SOTA LLMs Locally: The Hardware and Config That Actually Works

A detailed breakdown of jamesob's viral local LLM guide covering the $2k and $40k hardware paths, critical BIOS settings, and why most setups fail at PCIe negotiation and IOMMU.

News Hacker News Local LLM Hardware AI Infrastructure Self-Hosting

BlogJun 18, 2026

Local Qwen Is a Different Tool, Not a Worse Opus

Alex Ellis shares real production experience running local LLMs: $12k hardware investment, 2-3 month ROI, and why treating local models as Opus substitutes misses the point entirely.

News Hacker News Local LLM Qwen Claude Self-Hosting AI Tools

BlogJun 17, 2026

Cohere's North Mini Code: A 30B Open-Weight Coding Model That Runs on One H100

Cohere shipped its first developer-facing model on June 9, 2026. North Mini Code is a 30B mixture-of-experts coding model with 3B active parameters, Apache 2.0 weights, and a deployment footprint of a single H100. Here is what it actually offers and where the open questions are.

local llm coding tools open source self-hosting ai tools developer workflow

BlogJun 17, 2026

Self-Hosting Open-Weights Models: The Real Break-Even Math

Open weights are free to download, but inference is not free to run. Here is the honest break-even math on when self-hosting GLM-5.2, DeepSeek V4, or Llama beats paying per-token API prices - GPU rental and ownership costs, real throughput, utilization, the crossover in tokens per month, and the hidden ops bill nobody budgets for.

pricing open-weights self-hosting gpu llm-pricing cost-analysis

BlogJun 10, 2026

The Best Local Coding LLMs in 2026: Run Enterprise-Grade AI Without the Cloud

Choosing a local coding LLM in 2026 means balancing benchmark performance, hardware cost, and the compliance pressure to keep code off third-party servers. Here is what to run and on what hardware.

local llm coding tools self-hosting privacy ai tools developer workflow

BlogApr 29, 2026

Self-Hosting AI Agents: 5 Ways to Run Claude Code on Your Own Infra

Claude Code does not have to call Anthropic's API. Here are five working patterns for running it through your own gateway, on your own models, in your own VPC, with full audit logs and cost control.

Claude Code Self-Hosting DevOps AI Gateway LiteLLM Bedrock

Browse All Tags

SELF-HOSTING

Get Smarter About AI Dev

SELF-HOSTING

Get Smarter About AI Dev