A couple of weeks ago, we released EvoAgentBench, a benchmark for testing both your agent's raw capabilities and its self-evolving capabilities.
Since release, it's been downloaded over 730 times — ranking it the #
2# agent benchmark on hugging face.
What it actually test🧵