Register and share your invite link to earn from video plays and referrals.

Goodfire
@GoodfireAI
Using interpretability to understand, learn from, and design AI.
Joined August 2024
29 Following    22.8K Followers
A simple example: days of the week, which lie on a circular path in models’ activations. Steering linearly from Monday to Friday gets you incoherent outputs in between. Steering along the circular manifold means you cleanly shift from Mon → Tues → Wed → Thurs → Fri. (5/8)
Show more