Nemotron-3-Nano (4B), new hybrid Mamba + Attention model from NVIDIA, running locally in your browser on WebGPU.
Sentiment Mix
Geography
Expert Signals
xenovatech
author • 1 mention
r/LocalLLaMA
source • 1 mention
AI-Generated Claims
Generated from linked receipts; click sources for full context.
Nemotron-3-Nano (4B), new hybrid Mamba + Attention model from NVIDIA, running locally in your browser on WebGPU..
Supported by 1 story
I haven't seen many people talking about NVIDIA's new Nemotron-3-Nano model, which was released just a couple of days ago...
Supported by 1 story
On my M4 Max, I get \~75 tokens per second - not bad!
Supported by 1 story
It's a 4B hybrid Mamba + Attention model, designed to be capable of both reasoning and non-reasoning tasks.
Supported by 1 story
Link to demo (+ source code): [https://huggingface.co/spaces/webml-community/Nemotron-3-Nano-WebGPU](https://huggingface.co/spaces/webml-community/Nemotron-3-Nano-WebGPU)
Supported by 1 story
Related Events
Nvidia built a silent opinion engine into NemotronH to gaslight you and they're not the only ones doing it
Hardware • 3/20/2026
Nemotron Cascade 2 30B A3B
Uncategorized • 3/20/2026
From rising tensions between the Pentagon & Anthropic to Nvidia’s massive GPU deal with Amazon, the AI race is accelerating on all fronts. Micron warns of a prolonged memory shortage, OpenAI moves to unify its ecosystem into a super app, and Google’s St - LinkedIn
LLMs • 3/20/2026
Meta Compute Initiative Launches for AI Infrastructure Boost - eWeek
Hardware • 3/20/2026
Nvidia has an OpenClaw strategy. Do you?
Robotics • 3/20/2026