註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Subbu
@subbdue
I develop AI inference Chips/SoCs for a living. Read my work and subscribe to my newsletter at
加入 June 2009
31 正在關注    604 粉絲
Add on a few more months before improved kernels and meaningful MFU. I write about why this is the case in my deep dive: The Uncomfortable Truth Behind Deploying the Latest NVIDIA GPUs: MFU, Silent Data Corruption -
顯示更多
IMPORTANT: it is important to understand that the CoreWeave & Microsoft photos are still Engineering/Quality Samples, and there is still some time before the software stack bring-up finishes & first production tokens are generated. The VR200 & MI455 rack metric to watch out for is time to first at-scale production token TTF-(ASP)-T. You can clearly see in the CW rack photos that none of the scale-out 800G OSFP cages are even populated.
顯示更多