註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

⚡🛡️ Evan Pappas
@Hevalon
🛡️ Ex Technologia Libertas - Έλευθερία διὰ τῆς τέχνης - (Dec/Acc)
加入 September 2009
4K 正在關注    1.4K 粉絲
I built autoresearch-rl and pointed it at GRPO fine-tuning on @basilic_ai A100s. One command. 15 iterations. Zero human intervention. 100% infrastructure success rate. GSM8K pass@1: 26% baseline to 36%. The hard part wasn't the search algorithm. It was the infrastructure.
顯示更多
0
3
94
21
轉發到社區