Yucheng Shi(@Yucheng__Shi):What should AI generate in order to improve itself? Not just more questions, traces, or answers.  We believe it should learn to generate environments. Excited to share my first work after joining Tencent Hunyuan LLM. We study how models can construct reusable, verifiable environments that provide stable training signals for self-improvement. This is only a first feasibility step, but we see environment construction as a necessary path toward truly self-improving AI. Paper:

Yucheng Shi

@Yucheng__Shi

Research Scientist @ Tencent Hunyuan LLM (Seattle) | Post-training, RL & Agents |

加入 March 2022

82 正在关注 59 粉丝

Yucheng Shi@Yucheng__Shi

2026.05.15 18:45

What should AI generate in order to improve itself? Not just more questions, traces, or answers.  We believe it should learn to generate environments. Excited to share my first work after joining Tencent Hunyuan LLM. We study how models can construct reusable, verifiable environments that provide stable training signals for self-improvement. This is only a first feasibility step, but we see environment construction as a necessary path toward truly self-improving AI. Paper:

显示更多