註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Sudo su
@sudoingX
GPU/local LLM. more RAM and OSS... everywhere
加入 August 2022
976 正在關注    29.5K 粉絲
if you run a single 24gb gpu, a 3090, a 4090, a 7900 xtx, whatever gets you the 24 gigs, the no brainer pick is qwen 3.6 27b dense at q4. not close. i have run the tier. it fits in 24gb with real context room to spare, it keeps the reasoning smaller models lose, it pushes around 41 tok/s on a single 3090, and i watched it one shot a playable game start to finish, zero iterations. nothing else in that vram class does what this model does. undisputed king of the 24gb tier, and there is nothing you can say to change my mind.
顯示更多
0
40
349
16
轉發到社區