注册并分享邀请链接,可获得视频播放与邀请奖励。

Sudo su
@sudoingX
GPU/local LLM. more RAM and OSS... everywhere
加入 August 2022
976 正在关注    29.5K 粉丝
hermes agent is already the best on local models. but i'm working on more edges to make it fly even harder. before that, if your agent keeps crashing on local inference here's what to check: > max_turns: default is tuned for fast frontier models. bump from 30 to 50. slow local models need more breathing room per agentic loop. >gateway_timeout: raise from 600 to 1200. local inference at 12-17 tok/s will timeout silently and look like crashes. > context accumulation: auto-reset is off by default. your session grows until you /reset. long convos choke the agent. reset between major tasks. if you're running anything under 20 tok/s locally, these three settings are the difference between "broken" and "flying." tune your config before you blame the tool.
显示更多
0
23
275
18
转发到社区