antirez (@antirez) — X Web Viewer

2026.05.17 08:21

I didn't expect DeepSeek v4 PRO (not Flash) to run well on the Mac Studio M3 Ultra with 512GB of RAM. This is 2 bit quantized with the same DwarfStar recipe used for Flash. 433GB GGUF file. 130 t/s prefill, 13 t/s generation. Prefill in the video is low because small prompt.

1.1K

Forward to community

antirez@antirez

2026.05.15 10:39

I must admit that nothing about computers, since I'm in love with the field, was so uninteresting as the Javascript different fashions, waves, frameworks, rewrites, hypes. And I'm one that loves almost every shit programming related.

1.3K

Forward to community

antirez@antirez

2026.05.12 08:52

@BereznevKi20669 @ggerganov Yes I believe the real llama.cpp revolution is yet to happen at its full scale. As computers will have more RAM and models will improve, and *if* China will continue shipping large strong models with open weights, what will happen will have huge effects.

118

Forward to community

Most Popular Users