DeepSeek v4 works fine, but it’s not the frontier-pressing moment we saw with Kimi 2.6. On Notion eval data, it’s similar performance to GPT 5.2, with understandable failings.
Most interesting — it doesn’t scale well. It’s ridiculously slow. On multiple major, trusted, and performant US inference providers we see it 15x slower than GPT 5.2 and 2x slower than Opus 4.7, a problem Kimi never had.
Curious if it’s a fundamental issue in architecture, or a matter of time til inference providers make it work. Doesn’t seem urgent either way, if Kimi can outperform. Cheaper maybe, but not groundbreaking.
DeepSeek V4 Pro and Flash now available in Go
We rushed to get this released, still working out the capacity and usage limits
Thanks to the @deepseek_ai team for the PRs and fixes
Latest open artifacts (#21#): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment.
An eventful month with one flagship release after another
The RouterLink model ecosystem just hit a new milestone 🔗
68 models live. 9 providers. All in one place 🤯
Newly integrated:
➡️ GPT-5.5
➡️ GPT-5.4 Mini
➡️ Grok 4.3
➡️ DeepSeek V4 Pro
➡️ DeepSeek V4 Flash
Plus Amazon Bedrock joins as a new provider — more infrastructure diversity, more resilience.
Bonus: Gemini series now 25% off — same models, lower cost ⚡
One API key. Every frontier model. Built to scale 🧠
#WORLD3# #WAI# #AIagent# #RouterLink#
🚨 OPEN SOURCE AI IS LITERALLY UNSTOPPABLE 🚨
The legendary founder of Redis (Antirez) just dropped ds4 - a custom native inference engine built specifically for DeepSeek v4 Flash
This is earth shattering! Here is why:
DeepSeek v4 Flash is a quasi-frontier model with a massive 1M context window
You can now run it LOCALLY on a 128GB Mac using specialized 2-bit quantization
The architecture is reimagined—he moved the KV cache from RAM directly to the SSD disk! 🤯
We already know DeepSeek v4 Flash is insanely good for agentic loops - Now you don't even need the cloud to run it
Closed-source labs are burning tens of billions on massive GPU clusters while single brilliant developers are running frontier-level AI on laptops!
They told us open-source would be worthless against trillion-dollar monopolies
Instead, pure hacker culture + incredible open-weight models are completely rewriting the rules
Open Source will ALWAYS win 💕
Best models - May Edition
Coding - GPT 5.5 xHigh
Seeking truth - Grok 4.3
Video - SeeDance 2.0
Image - GPT Image 2.0
Voice - Gemini Live
Hermes - m2.7
Cheap coding - Kimi 2.6
Cheap fast - Gemini Flash
Best open source - DeepSeek v4
Pretty much everything will change after Google I/O
Smart Studio: Self-host the latest AI 🚀
Stop jumping between platforms. Everything you need to test and serve models is now in one place:
✅ Instant SOTA Access: Run Qwen3.6-Max, DeepSeek-v4, and the latest models the moment they drop.
✅ Full Multimodal Support: Access multimodal and Image & Video generation models.
✅ Visual Model Lab: Compare open vs. closed-source outputs side-by-side.
✅ HF-to-API in Minutes: Turn Hugging Face model into live API in minutes.
🔗:
#AlibabaCloud# #SmartStudio# #ModelExploration# #GenAI# #AInnovation# #LLM#