RLinf is a flexible, scalable and open-source infrastructure designed for reinforcement-learning (RL) post-training of foundation models — including large language models (LLMs), vision-language models (VLMs), and vision-language-action (VLAs) models.
Latest commits.
Builders behind this project.