Robert Lauko(@robert_lauko ):We tested Claude, GPT-5.2, and Gemini as scientific paper reviewers on @KurateOrg. When scoring arXiv papers on impact (1–10), GPT clusters around 7–8 and Gemini is similar. Claude Opus 4.6 uses the full range, making it a far better discriminator. See the distributions 👇

Robert Lauko

@robert_lauko

Founder of Liquity and (AI-based scientific impact ranking of papers)

Joined February 2020

329 Following 4.7K Followers

Robert Lauko@robert_lauko

2026.05.09 09:23

We tested Claude, GPT-5.2, and Gemini as scientific paper reviewers on @KurateOrg. When scoring arXiv papers on impact (1–10), GPT clusters around 7–8 and Gemini is similar. Claude Opus 4.6 uses the full range, making it a far better discriminator. See the distributions 👇

1.5K

173

Forward to community

Most Popular Users