註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Larry Dial
@classiclarryd
Technical Staff at Open Athena, working on Marin
加入 May 2024
35 正在關注    1.7K 粉絲
Researchers' brilliant ideas often get lost in the sea of endless SOTA claims on weak baselines. At Marin we battle-test ideas in an open arena, where anyone's idea can be promoted to the next hero run. One that recently rose up was @Jianlin_S MoE Quantile Balancing, used in our last 1e22 and ongoing 130B run. Animated visuals of how QB performed are available in the OpenAthena blog.
顯示更多
0
9
241
30
轉發到社區