← Agent Marketplace
Open-source project

BALROG

by balrog-ai

Benchmarking Agentic LLM and VLM Reasoning On Games

255 starsSelf-host

Skills

An open-source project — explore the code and self-host it from GitHub.