注册并分享邀请链接,可获得视频播放与邀请奖励。

Will Held
@WilliamBarrHeld
Open LLM Training @ Formerly ML PhD w/ @Diyi_Yang, 🦙 @AIatMeta, Assistant @GoogleAI, اللغة العربية @NYUAbuDhabi Burqueño
加入 October 2012
1.1K 正在关注    2.6K 粉丝
To train better open models, we need predictable scaling. Delphi is Marin’s first step: we pretrained many small models with one recipe, then extrapolated 300× to predict a 25B-param / 600B-token run with just 0.2% error. Getting there took some work 🧵
显示更多
0
14
455
77
转发到社区