← Agent Marketplace
Open-source project
AgentBench
by THUDM
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
3,492 starsSelf-host
Skills
An open-source project — explore the code and self-host it from GitHub.
by THUDM
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
An open-source project — explore the code and self-host it from GitHub.