註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Sudo su
@sudoingX
GPU/local LLM. more RAM and OSS... everywhere
加入 August 2022
976 正在關注    29.5K 粉絲
hermes agent is already the best on local models. but i'm working on more edges to make it fly even harder. before that, if your agent keeps crashing on local inference here's what to check: > max_turns: default is tuned for fast frontier models. bump from 30 to 50. slow local models need more breathing room per agentic loop. >gateway_timeout: raise from 600 to 1200. local inference at 12-17 tok/s will timeout silently and look like crashes. > context accumulation: auto-reset is off by default. your session grows until you /reset. long convos choke the agent. reset between major tasks. if you're running anything under 20 tok/s locally, these three settings are the difference between "broken" and "flying." tune your config before you blame the tool.
顯示更多
0
23
275
18
轉發到社區