Register and share your invite link to earn from video plays and referrals.

John Schulman
@johnschulman2
Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Joined May 2021
1.8K Following    74.5K Followers
Happy to share a new paper! Designing model behavior is hard -- desirable values often pull in opposite directions. Jifan's approach systematically generates scenarios where values conflict, helping us see where specs are missing coverage and how different models balance tradeoffs.
Show more
New research paper with Anthropic and Thinking Machines AI companies use model specifications to define desirable behaviors during training. Are model specs clearly expressing what we want models to do? And do different frontier models have different personalities? We generated thousands of scenarios to find out. 🧵
Show more