Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques...
Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries.
Google's distillation + sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.
Gemini 3.2 has an 89% chance of dropping May 19 on Polymarket.
That's one week from today.
I think this model beats GPT 5.5 and Claude Opus 4.7.
Google has been quietly building.
3.1 Pro already had elite reasoning and the lowest hallucination on BridgeBench.
The only thing holding it back was the tooling.
If 3.2 ships with reliable tool calling at Google I/O, the leaderboard resets.
Testing it on BridgeBench the second it drops.
Gemini Omni is coming...
A supremely advanced video model that can do really fancy video editing and understands the world better
Google I/O hype train just left the station 😄
Gemini 3.1 Flash Lite Preview is now Gemini 3.1 Flash Lite.... 🤷
Not the earth shattering new release we are all expecting
I think that is still coming - next week
Gemini 3.2 has a 47% chance of dropping next week according to Polymarket.
If Google fixes the tool calling problem, this model could be better than GPT 5.5 and Claude Opus 4.7.
Gemini 3.1 Pro already had the reasoning and the lowest hallucination rates on BridgeBench.
The intelligence was never the issue.
The tooling was.
If Gemini 3.2 ships with reliable tool calling and agent support, the entire leaderboard changes.
This could be the most important model drop of the summer.
Gemini 3.2 spotted in Gemini!
If we already receive Gemini 3.2 Flash now, the major release will probably be reserved for I/O.
h/t @Waguri_Kaoruko8 for finding