由 alibaba 提供
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
一个开源项目——浏览代码并从 GitHub 自托管。