Mario Nawfal(@MarioNawfal ):GROK-3 MINI MADE AI HISTORY—100% ON HARDCORE REASONING TESTS Grok-3 Mini pulled off what no other model has! It aced every question on one of the toughest reasoning benchmarks out there. The test? A custom logic gauntlet packed with curveballs: * 120/120 on the “Marcus Problem” — full of shuffled sentences meant to trip up inference. * 24/24 on the “Alice+ Problem” — designed with irrelevant noise to throw models off course. * 24/24 on high-difficulty mixed challenges — where even GPT-4.5 and Gemini 2.5 Pro slip. No guessing. No trivia. Just pure, distraction-proof reasoning—Grok-3 Mini nailed it! Source: @hive

2025.04.12 15:20

GROK-3 MINI MADE AI HISTORY—100% ON HARDCORE REASONING TESTS Grok-3 Mini pulled off what no other model has! It aced every question on one of the toughest reasoning benchmarks out there. The test? A custom logic gauntlet packed with curveballs: * 120/120 on the “Marcus Problem” — full of shuffled sentences meant to trip up inference. * 24/24 on the “Alice+ Problem” — designed with irrelevant noise to throw models off course. * 24/24 on high-difficulty mixed challenges — where even GPT-4.5 and Gemini 2.5 Pro slip. No guessing. No trivia. Just pure, distraction-proof reasoning—Grok-3 Mini nailed it! Source: @hive_echo