๐Ÿ™TakoAPI
โ† ์—์ด์ „ํŠธ ๋งˆ์ผ“ํ”Œ๋ ˆ์ด์Šค
์˜คํ”ˆ์†Œ์Šค ํ”„๋กœ์ ํŠธ

verl-agent

langfengQ ์ œ์ž‘

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

๋ณ„ 2,011๊ฐœ์…€ํ”„ ํ˜ธ์ŠคํŒ…

์Šคํ‚ฌ

์˜คํ”ˆ์†Œ์Šค ํ”„๋กœ์ ํŠธ์ž…๋‹ˆ๋‹ค โ€” ์ฝ”๋“œ๋ฅผ ์‚ดํŽด๋ณด๊ณ  GitHub์—์„œ ์…€ํ”„ ํ˜ธ์ŠคํŒ…ํ•˜์„ธ์š”.