NVIDIA AI(@NVIDIAAI ):Love it. Well done.

NVIDIA AI

@NVIDIAAI

Teaching your AI new tricks.

Joined June 2016

853 Following 290.6K Followers

NVIDIA AI@NVIDIAAI

2026.05.07 16:23

Love it. Well done.

stevibe@stevibe

2026.05.07 09:36

Google dropped MTP versions of Gemma4. Ran them on my DGX Spark. The 31B dense model went from 3.94 → 8.91 tok/s. That's +126%. Full results: [26B A4B] > 25.24 → 31.69 tok/s (+25.6%) > TTFT 755 → 332ms (-56%) [31B] > 3.94 → 8.91 tok/s (+126%) > TTFT 599 → 378ms (-37%) If you're not running MTP, you're leaving free perf on the table.

Show more

0

0

6

133

12

Forward to community

Most Popular Users

BBC News 中文

5.2M Followers

240M Followers

71.3M Followers

李老师不是你老师

@whyyoutouzhele

2.2M Followers

National Geographic

27.8M Followers

15.1M Followers

25.7M Followers

216.8K Followers

813K Followers

106.8M Followers

41.7M Followers

37.9M Followers

20.4M Followers

20M Followers

29M Followers