LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

nemo-rl

Scalable toolkit for efficient model reinforcement

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

8mo ago

Recent commits

Latest commits.

feat: additional kl metrics (#1420)
4db0db2Zhiyu Li8mo ago
fix: Adding mean total tokens per sample to the output log (#1406)
b3aac89Youngeun Kwon8mo ago
fix: Fix grad norm metric in mcore path (#1426)
9475e7bYi-Fu Wu8mo ago
chore: major version bump (torch 2.8, vllm 0.11, ray 2.49) & SP fixes (#1334)
3f36d14Terry Kong8mo ago
fix: append to hf_overrides rather than overwriting (#1413)
79269afAnna Shors8mo ago

docs: Update README.md to say NeMo RL (#1424)

e9bd6fdSylendran958mo ago

docs: Add repo overview diagram (#1403)

e762237Wenwen Gao8mo ago

feat: Overlap param iteration and broadcast in non-colocated refit (#1379)

73e0c09Youngeun Kwon8mo ago

Top contributors

Builders behind this project.