註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

xAI
@xai
加入 May 2023
5 正在關注    2M 粉絲
Humanity's Last Exam (HLE) is a rigorous intelligence benchmark featuring over 2500 problems crafted by experts in mathematics, natural sciences, engineering, and humanities. Most models score single-digit accuracy. Grok 4 and Grok 4 Heavy outperform all others.
顯示更多
0
51
612
85
轉發到社區