由 OpenPipe 提供
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
一个开源项目——浏览代码并从 GitHub 自托管。