Owain Evans
@OwainEvans_UK
Runs an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
Joined April 2020
434 Following    19.5K Followers
Our paper on Subliminal Learning was just published in Nature! Last July we released our preprint. It showed that LLMs can transmit traits (e.g. liking owls) through data that is unrelated to that trait (numbers that appear meaningless). What’s new?🧵 https://t.co/Iiv9sgjJki
Show more
0
40
888
140
Forward to community