Next week is going to be crucial
Will Google pull ahead of OpenAI or vice versa?
GPT 5.6 or Gemini 3.2
Who will win?
Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques...
Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries.
Google's distillation + sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.
Show more
Open-source AI is ruthlessly out-innovating the trillion-dollar monopolies. 🚀
Big labs are burning billions brute-forcing AGI on massive GPU clusters. Meanwhile, the open ecosystem is structurally forced to innovate on inference—and it's working.
Look at what just happened:
- DeepSeek v4 using SSDs for KV cache.
- Breakthroughs like TurboQuant and Kimi K2 are aggressively compressing memory and driving the cost of intelligence to near zero.
When you don't have infinite compute, you actually have to engineer better solutions.
Constraints breed miracles. By solving the KV cache bottleneck, scrappy open-source builders are creating vastly cheaper and more profitable AI than the bloated closed-source giants.
Hacker culture > GPU monopolies. Period.
Show more
Opus 4.7 is released in fast mode...
Will pass on it - it's NOT a great model and is insanely expensive
In the meantime, we finally have DeepSeek flash working on for a real-world use-case
Flash 3.2 is more or less confirmed for Google I/O
We are already using Flash instead of GPT 5.5 low in 70% of our scheduled jobs
It will be HUGE if Gemini Flash 3.2 can totally Replace GPT 5.5 low
Gemini Omni is coming...
A supremely advanced video model that can do really fancy video editing and understands the world better
Google I/O hype train just left the station 😄
Human Programmers Will Stick Around….
While you can totally vibe code an app written by AI from scratch
It’s a total nightmare to use AI coding on a complex human generated code base
Humans typically generate more complexity and tech debt than AI can handle
We still desperately need human programmers 😅
Show more
US investors need to fund a dozen startups with $1B each immediately
Their mission - start playing and winning in the open source arena ASAP
They will all end up being $0.5T to $1T companies
It’s no longer a given that the next generation model will be better
- Opus 4.7 is legit worse than 4.6
- Gemini 3.1 worse than 2.5
- Sonnet 4.6 buggier than 4.5
The SOTA models are beginning to run around in circles
Show more
Google I/O will be a critical point in the evolution of the company
Either Gemini models deliver on multiple fronts…..
Or Google becomes a data center and compute seller 😲
🚨 OPEN SOURCE AI IS LITERALLY UNSTOPPABLE 🚨
The legendary founder of Redis (Antirez) just dropped ds4 - a custom native inference engine built specifically for DeepSeek v4 Flash
This is earth shattering! Here is why:
DeepSeek v4 Flash is a quasi-frontier model with a massive 1M context window
You can now run it LOCALLY on a 128GB Mac using specialized 2-bit quantization
The architecture is reimagined—he moved the KV cache from RAM directly to the SSD disk! 🤯
We already know DeepSeek v4 Flash is insanely good for agentic loops - Now you don't even need the cloud to run it
Closed-source labs are burning tens of billions on massive GPU clusters while single brilliant developers are running frontier-level AI on laptops!
They told us open-source would be worthless against trillion-dollar monopolies
Instead, pure hacker culture + incredible open-weight models are completely rewriting the rules
Open Source will ALWAYS win 💕
Show more
Best models - May Edition
Coding - GPT 5.5 xHigh
Seeking truth - Grok 4.3
Video - SeeDance 2.0
Image - GPT Image 2.0
Voice - Gemini Live
Hermes - m2.7
Cheap coding - Kimi 2.6
Cheap fast - Gemini Flash
Best open source - DeepSeek v4
Pretty much everything will change after Google I/O
Show more
And they said open-source AI would be worthless!!
All of these companies will 5-10x in 1 year
DeepSeek Pro and Flash continue to be extremely underrated.
Both are excellent choices for easy agentic loops
20x cheaper than GPT 5.5 and just as good
Running loops burning infinite tokens a day leads to extreme
brain rot
Once AI has generated 10k lines of code, engineers have zero idea of what is going on
The bugs multiply, AI debt spins out of control and uptime drops
Show more
Google I/O Predictions
- new video model, Veo 3.5
- Nano Banana 3
- Flash 3.2
- Gemini Pro 3.5
Gemini beats GPT 5.5 at coding
Gemini 3.1 Flash Lite Preview is now Gemini 3.1 Flash Lite.... 🤷
Not the earth shattering new release we are all expecting
I think that is still coming - next week
🚨 RIP FIGMA - CREATE PIXEL PERFECT DESIGNS AND MOCK-UPs
Excited for Abacus AI's Design Vertical
It's almost jaw dropping - create pixel perfect high fidelity mock-ups from prompts and sketches
The one-click from the mock-ups to finished apps
Show more
In the midst of all this extreme OpenAI drama
WHERE IS ILYA?