if you run a single 24gb gpu, a 3090, a 4090, a 7900 xtx, whatever gets you the 24 gigs, the no brainer pick is qwen 3.6 27b dense at q4. not close.
i have run the tier. it fits in 24gb with real context room to spare, it keeps the reasoning smaller models lose, it pushes around 41 tok/s on a single 3090, and i watched it one shot a playable game start to finish, zero iterations.
nothing else in that vram class does what this model does. undisputed king of the 24gb tier, and there is nothing you can say to change my mind.
显示更多