working at prime is just "ugh i had this gnarly problem, let’s fix it and then make it available to everyone"
a ton of other things are coming, can’t wait to show it to yall :)
Introducing Renderers
RL trainers work in tokens. Environments work in messages. Going back and forth corrupts sampled tokens, wasting compute on every agentic turn.
With Renderers, we fix this mismatch. This unlocks >3x throughput on popular open models.