最近在小红书测试一系列AI 的短视频
把我之前做crypto 内容掉的粉丝给涨回来了
@arcprize 的ARC-AGI-3 很有意思
人类第一次玩这些游戏就能 100% 高效通关
而目前所有前沿 AI 模型(包括 GPT、Claude、Gemini 等)的整体得分仍低于 1%
Announcing ARC-AGI-3
The only unsaturated agentic intelligence benchmark in the world
Humans score 100%, AI <1%
This human-AI gap demonstrates we do not yet have AGI
Most benchmarks test what models already know, ARC-AGI-3 tests how they learn
더 보기