Register and share your invite link to earn from video plays and referrals.

OpenAI
@OpenAI
OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring:
Joined December 2015
4 Following    4.9M Followers
Chain of thought monitors are a key layer of defense against AI agent misalignment. To preserve monitorability, we avoid penalizing misaligned reasoning during RL. We found a limited amount of accidental CoT grading which affected released models, and are sharing our analysis.
Show more
0
331
3K
295
Forward to community