Sudo su(@sudoingX):nemotron 3 nano omni-30B reasoning at Q8 running autonomously on my dgx spark right now. 58 tok/s. 1 million context. multimodal. hermes agent is using it to research xAI's new grok algorithm that dropped yesterday. pulling repos. scanning code. breaking it down. all while i post this. 30B model. 58 tokens per second. 1M context. reads images and video. locally. for free. nobody is talking about this model and that's insane.

2026.05.16 08:11

nemotron 3 nano omni-30B reasoning at Q8 running autonomously on my dgx spark right now. 58 tok/s. 1 million context. multimodal. hermes agent is using it to research xAI's new grok algorithm that dropped yesterday. pulling repos. scanning code. breaking it down. all while i post this. 30B model. 58 tokens per second. 1M context. reads images and video. locally. for free. nobody is talking about this model and that's insane.