Pavlo Molchanov(@PavloMolchanov):We are releasing Star Elastic - turn ONE reasoning LLM into MANY sizes with a single post-training run. 360× cheaper than pretraining a family of models. 7× better than SOTA compression. Split reasoning capability. Plus elastic budget control that beats the accuracy-latency frontier. Paper: https://t.co/kOdTSZ1jHb HF models: https://t.co/1SWU6O7xsE Thread 👇

Pavlo Molchanov

@PavloMolchanov

Director of Research @NVIDIA

加入 March 2014

436 正在关注 3.9K 粉丝

Pavlo Molchanov@PavloMolchanov

2026.05.13 16:58

We are releasing Star Elastic - turn ONE reasoning LLM into MANY sizes with a single post-training run. 360× cheaper than pretraining a family of models. 7× better than SOTA compression. Split reasoning capability. Plus elastic budget control that beats the accuracy-latency frontier. Paper: HF models: Thread 👇

显示更多