Register and share your invite link to earn from video plays and referrals.

Sudo su
@sudoingX
GPU/local LLM. more RAM and OSS... everywhere
Joined August 2022
976 Following    29.5K Followers
hermes agent is already the best on local models. but i'm working on more edges to make it fly even harder. before that, if your agent keeps crashing on local inference here's what to check: > max_turns: default is tuned for fast frontier models. bump from 30 to 50. slow local models need more breathing room per agentic loop. >gateway_timeout: raise from 600 to 1200. local inference at 12-17 tok/s will timeout silently and look like crashes. > context accumulation: auto-reset is off by default. your session grows until you /reset. long convos choke the agent. reset between major tasks. if you're running anything under 20 tok/s locally, these three settings are the difference between "broken" and "flying." tune your config before you blame the tool.
Show more