MiniMax M2.7 is now on OrcaRouter 🐋
One of the strongest open-source models available today — now accessible through a single OpenAI-compatible API.
Pricing:
Input: $0.30 / 1M tokens
Output: $1.20 / 1M tokens
Cache read: $0.06 / 1M
Cache write: $0.375 / 1M
Use @MiniMax_AI and more:
💡 @MiniMax_AI M2.7 is an open-weight LLM built for serious dev work.
It’s the first in MiniMax’s M-series to “self-evolve” via its own training + eval loop (agent harness optimization). Designed for complex coding, multi-agent systems, and pro-grade workflows.
Learn more:
🚀 @MiniMax_AI M2.7 running fastest on SambaCloud
Watch it build a browser-based OS in a single HTML file, complete with working apps like Snake, Paint, and a calculator.
Try it yourself:
🚀 @MiniMax_ai M2.7 isn’t just another coding model.
It was used to improve its own agent harness across 100+ rounds of iteration, helping drive major gains in coding, multi-agent collaboration, and real-world workflows.
Try it now:
oMLX hits 47 tokens per second on a base M2 MacBook Pro by offloading context to the SSD. We explore how native MLX features achieve 3x faster generation than LM Studio in our latest test.
Big update: just launched with MiniMax-M2.7 integration! Self-improving model crushes complex coding/Agent tasks (56%+ SWE-Pro, near Opus level) at 1/10–1/20 cost. Prompt to production app — auth, Stripe, DB, one-click deploy included.
See what we have at Qwen Conference 2026?
1,000 m² immersive exhibition. Four curated tech zones. This is a massive showcase of the entire Qwen ecosystem — from foundation models to agentic infrastructure, from Alibaba Cloud's full-stack AI services to 30+ industry benchmarks delivering real impact.
Walk through it all. And while you're here, experience Qwen Cloud — the simplest gateway to access frontier model. Let's see it live, try it live .
Scan the QR code. Visit Qwen Cloud and register for Qwen Conference now — and come witness the most immersive AI experience of 2026.
From Infrastructure to Interface: Closes the Loop.
In response to community demand, we have officially synchronized our Web Chat with our API ecosystem. The four frontier models—including GPT-5.5-Instant, DeepSeek-V3.2, MiniMax-M2.7, and GLM-5.1—are now fully accessible to all web users.
We have bridged the gap between developer-grade integration and consumer-facing interaction. Whether via API or Web, you can now experience the same production-grade reliability and reasoning consistency.
Compute without limits. Innovate without boundaries.
Start now:
从底层基建到场景应用: 正式打通全链路模型闭环。🌐
应社区用户的热切期待, 现已完成 API 与 Web Chat 的双端能力同步。此前在 API 侧首发的 GPT-5.5-Instant、DeepSeek-V3.2、MiniMax-M2.7 以及 GLM-5.1 四大顶尖模型,现已在网页端全量上线。
无论你是追求高自由度的开发者,还是深耕生产力的专业用户,现在都能在 获得一致的生产级可靠性与逻辑精度。
算力不设限,灵感无边界。
立即体验: