Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!
3,713stars
Python
Apache License 2.0
Added 7/30/2025
Tags
Research & SafetyReal-Time AI / PersonalizationOpen Source & CommunityMLOps / AutoMLDeveloper PlatformGenerative AI