Elon Musk
@elonmusk
Joined June 2009
1.1K Following    219.3M Followers
Soon, AI will far exceed the best humans in reasoning
GROK-3 MINI MADE AI HISTORY—100% ON HARDCORE REASONING TESTS Grok-3 Mini pulled off what no other model has! It aced every question on one of the toughest reasoning benchmarks out there. The test? A custom logic gauntlet packed with curveballs: * 120/120 on the “Marcus Problem” — full of shuffled sentences meant to trip up inference. * 24/24 on the “Alice+ Problem” — designed with irrelevant noise to throw models off course. * 24/24 on high-difficulty mixed challenges — where even GPT-4.5 and Gemini 2.5 Pro slip. No guessing. No trivia. Just pure, distraction-proof reasoning—Grok-3 Mini nailed it! Source: @hive_echo
Show more
0
8.9K
40.8K
7.4K