Introducing Renderers
RL trainers work in tokens. Environments work in messages. Going back and forth corrupts sampled tokens, wasting compute on every agentic turn.
With Renderers, we fix this mismatch. This unlocks >3x throughput on popular open models.