Lawrence Chan (@justanotherlaw)

2026.05.07 21:13

Glad more AI safety work is getting reviewed by independent parties. Most lab posts never go through peer review, and when they do it's with a 4+ month lag: an eternity in AI terms. Public reviews like Buck'scan help surface methodological gaps and keep researchers honest.

Buck Shlegeris@bshlgrs

2026.05.07 19:06

We reviewed OpenAI's blog post “Investigating the consequences of accidentally grading CoT during RL".

Forward to community

Lawrence Chan@justanotherlaw

2026.05.04 20:06

Forgot to add: it was great working with @ben_sturgeon on this! He was patient as I raised objection after objection from reading data/transcripts + he taught me a lot about managing multiple AI agents. He’s a MATS extension scholar on persona stuff; if you’re hiring, reach out!

Lawrence Chan@justanotherlaw

2026.05.02 02:17

A recent viral paper claims to reverse-engineer the parameter counts of frontier models: GPT-5.5 = 9.7T, Opus 4.7 = 4.0T, o1 = 3.5T, etc. @ben_sturgeon and I investigated and found serious issues in the paper; fixing them gives GPT-5.5 as ~1.5T (90% CI: 256B-8.3T).

Forward to community

Lawrence Chan@justanotherlaw

2026.05.04 19:32

In 2022, I joined what was then ARC Evals. Last Friday, I wrapped up at @METR_Evals. METR has done some of the most important work in AI; I'm grateful to @BethMayBarnes and others for letting me be part of it. I'll be taking time to write, reflect, and think. More to come soon!

134

Forward to community

Lawrence Chan Reposted

Benno Sturgeon@ben_sturgeon

2026.05.02 02:45

Two reflections on my IKP sanity check with @justanotherlaw. Viral papers are cheaper than ever to produce thanks to AI agents. Thankfully, AI agents also make checking viral claims easier than ever.

Forward to community

Lawrence Chan@justanotherlaw

2026.05.02 02:17

958

Forward to community

Lawrence Chan Reposted

Ryan Greenblatt@RyanPGreenblatt

2026.04.15 16:56

Current AIs (Opus 4.5/4.6) seem pretty misaligned to me (in a mundane behavioral sense). In my experience, they often oversell their work, downplay problems, and stop early while claiming to be done. They sometimes brazenly cheat.