← Agent Marketplace
Open-source project

AgentBench

by THUDM

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

3,492 starsSelf-host

Skills

An open-source project — explore the code and self-host it from GitHub.