注册并分享邀请链接,可获得视频播放与邀请奖励。

atomic.chat
@atomic_chat_hq
Free Local AI Chat. Enhanced by Google Turbo Quant.
加入 March 2026
13 正在关注    5.7K 粉丝
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 We tested three frontier models on a real agentic task: write a Tetris bot that plays the game and trains itself. Each model could read its own code, run benchmarks, and rewrite itself across 10 iterations. Then we compared the final bots head to head. Qwen 3.7-Max: training cost $1.32, bot improvement +56% Claude Opus 4.7: training cost $12.15, bot improvement +28% GPT-5.5: training cost $2.85, bot improvement +7% Qwen won on every dimension - biggest jump, 9× cheaper than Claude, 2× cheaper than GPT. Long agentic loops is where Qwen Max actually delivers.
显示更多
0
74
2.2K
208
转发到社区