METR (@METR_evals) — X Web Viewer

METR Reposted

2026.05.11 18:17

new research from me @METR_Evals: technical workers claim that today's AI impacts value of their work to an extraordinary degree (& growing over time). of course, self-reports plausibly overestimate. the magnitudes nonetheless strike me as remarkable.

Forward to community

METR Reposted

Parv Mahajan@parvmahajan0

2026.05.11 14:41

In unrelated news, I’m joining @METR_Evals full time. We‘re looking for the best people in the world to keep up with catastrophic risk - reach out if you’re interested!

262

Forward to community

METR@METR_Evals

2026.05.08 23:41

We evaluated an early version of Claude Mythos Preview for risk assessment during a limited window in March 2026. We estimated a 50%-time-horizon of at least 16hrs (95% CI 8.5hrs to 55hrs) on our task suite, at the upper end of what we can measure without new tasks.

2.1K

248

Forward to community