Mario Nawfal
@MarioNawfal
Largest Show on X | Founder @ibcgroupio | Investor 600+ Startups
Joined October 2020
46.7K Following    2.2M Followers
GROK-3 MINI MADE AI HISTORY—100% ON HARDCORE REASONING TESTS Grok-3 Mini pulled off what no other model has! It aced every question on one of the toughest reasoning benchmarks out there. The test? A custom logic gauntlet packed with curveballs: * 120/120 on the “Marcus Problem” — full of shuffled sentences meant to trip up inference. * 24/24 on the “Alice+ Problem” — designed with irrelevant noise to throw models off course. * 24/24 on high-difficulty mixed challenges — where even GPT-4.5 and Gemini 2.5 Pro slip. No guessing. No trivia. Just pure, distraction-proof reasoning—Grok-3 Mini nailed it! Source: @hive_echo
Show more
0
459
2.8K
569