1 item
1 tool
C++ inference engine for LLMs. GGUF format, quantization, CPU and Metal/CUDA support. The foundation most local tools build on.
New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.