Arena.ai
@arena
Where AI meets the real world. Formerly LMArena. We measure and advance the frontier of AI through community-driven evaluation. We’re hiring → https://t.co/XBZCrseaWF
212 Following    156.7K Followers
Kimi K2.6 is the new SOTA open model in Vision and Document Arena, with solid gains since Kimi K2.5: - #1# open on Vision Arena (#15# overall), +14 over #2# Kimi K2.5 (Thinking) - #1# open on Document Arena (#8# overall), +9 over K2.5 and on par with proprietary models like Muse Spark and Gemini 3.1 Pro. Huge congrats again to the @Kimi_Moonshot team on the open source progress!
Show more
Exciting news - GPT-Image-2 by @OpenAI has claimed the #1# spot across all Image Arena leaderboards! A clean sweep with a record-breaking +242 point lead in Text-to-Image - the largest gap we’ve seen to date. - #1# Text-to-Image (1512), +242 over #2# (Nano-banana-2 with web-search aka gemini-3.1-flash-image) - #1# Single-Image Edit (1513), +125 over #2# (Nano-banana-pro aka gemini-3-pro-image) - #1# Multi-Image Edit (1464), +90 over #2# (Nano-banana-2) No model has dominated Image Arena with margins this wide. Huge congratulations to @OpenAI on this major breakthrough in image generation! More performance breakdowns by category in the thread below.
Show more
0
209
5.8K
638
Forward to community
🚨 Top 10 Open Models in January: Text Arena Looking back last month, here are the rankings by provider for January: 🥇 #1# Kimi-K2.5-Thinking by @Kimi_Moonshot (Modified MIT) 🥈 #2# GLM-4.7 by @Zai_org (MIT) 🥉 #3# Qwen3-235b-a22b-instruct-2507 by @Alibaba_Qwen (Apache 2.0) Compared to December, the ranks have shifted with new variants, but the top labs have not changed. The top 5 open models all score above 1400. Will we see our first 1500 breakthroughs this year? See more details around the climbers and movers for January in thread 🧵
Show more