MAMBA

2 items

2 posts

BlogApr 29, 2026

NVIDIA Nemotron 3 Super: A Developer's Guide to the 120B Hybrid MoE

A practical walkthrough of Nemotron 3 Super: latent mixture of experts, hybrid Mamba transformer architecture, 1M context, reasoning modes, and the code you actually need to run it on NVIDIA hardware.

NVIDIA Nemotron MoE Mamba Open Source AI Models Triton Transformers

BlogMar 13, 2026

NVIDIA's Nemotron 3 Super in 6 Minutes

NVIDIA's Nemotron 3 Super combines latent mixture of experts with hybrid Mamba architecture - 120B total parameters, 12B active per token, 1M context, and up to 4x more experts at the same cost.

NVIDIA Nemotron MoE Mamba Open Source AI Models

Get Smarter About AI Dev

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.

One email per weekReal code, not theoryFree forever

Browse All Tags