new research from me @METR_Evals:
technical workers claim that today's AI impacts value of their work to an extraordinary degree (& growing over time).
of course, self-reports plausibly overestimate. the magnitudes nonetheless strike me as remarkable.
In unrelated news, I’m joining @METR_Evals full time.
We‘re looking for the best people in the world to keep up with catastrophic risk - reach out if you’re interested!
We evaluated an early version of Claude Mythos Preview for risk assessment during a limited window in March 2026. We estimated a 50%-time-horizon of at least 16hrs (95% CI 8.5hrs to 55hrs) on our task suite, at the upper end of what we can measure without new tasks.