RLinf is a flexible, scalable and open-source infrastructure designed for reinforcement-learning (RL) post-training of foundation models — including large language models (LLMs), vision-language models (VLMs), and vision-language-action (VLAs) models.
Latest commits.
No recent commits available.
Builders behind this project.
No contributor data available.