Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
nemo-rl
Scalable toolkit for efficient model reinforcement
Other
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
8mo ago
Recent commits
Latest commits.
feat: additional kl metrics (#1420)
4db0db2
Zhiyu Li
8mo ago
fix: Adding mean total tokens per sample to the output log (#1406)
b3aac89
Youngeun Kwon
8mo ago
fix: Fix grad norm metric in mcore path (#1426)
9475e7b
Yi-Fu Wu
8mo ago
chore: major version bump (torch 2.8, vllm 0.11, ray 2.49) & SP fixes (#1334)
3f36d14
Terry Kong
8mo ago
fix: append to hf_overrides rather than overwriting (#1413)
79269af
Anna Shors
8mo ago
docs: Update README.md to say NeMo RL (#1424)
e9bd6fd
Sylendran95
8mo ago
docs: Add repo overview diagram (#1403)
e762237
Wenwen Gao
8mo ago
feat: Overlap param iteration and broadcast in non-colocated refit (#1379)
73e0c09
Youngeun Kwon
8mo ago
Top contributors
Builders behind this project.
terrykong
135 commits
parthchadha
57 commits
ashors1
56 commits
chtruong814
44 commits
yuki-97
44 commits
SahilJain314
39 commits
yfw
26 commits
ZhiyuLi-Nvidia
14 commits