Register and share your invite link to earn from video plays and referrals.

Zihan "Zenus" Wang
@wzenus
Reasoning agent / RL / efficiency research @NorthwesternU & incoming @nvidia. Ex @Microsoft @yutori_ai @deepseek_ai @uiuc_nlp @RUC1937.
Joined March 2022
665 Following    23K Followers
In Agent RL, models suffer from Template Collapse. They generate vast, diverse outputs (High Entropy) that lose all meaningful connection to the input prompt (Low Mutual Information). In other words, agent learn different ways to say nothing. ๐Ÿš€ Introducing RAGEN-v2 -- Here's how we define and fix such silent failure modes in Agent RL. ๐Ÿงต
Show more