注册并分享邀请链接,可获得视频播放与邀请奖励。

Lysandre
@LysandreJik
Chief Open-Source Officer (COSO) at Hugging Face
加入 March 2019
636 正在关注    12K 粉丝
Great to see inference engines starting to leverage kernels on the Hub, in this case sglang. It's probably the easiest and fastest way to install flash attention and other specialized kernels right now.
显示更多