LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Ecosystems
Launchpads

Search

Python

NeMo-RL-ProRL

Scalable toolkit for efficient model reinforcement

PythonEmerging

Stars

1

Forks

—

Contributors

8

Last push

7mo ago

Recent commits

Latest commits.

fix: Reinitialize model parallel after import (#1317)
d726c38Yubo Gao8mo ago
fix: enhancing non-colocated refit performance by having inclusive comm group (#1264)
57046a4Youngeun Kwon8mo ago
feat: Add deepseek flops tracker (original #1250) (#1305)
3fccb63Guyue Huang8mo ago
fix: qwen32 nightly metric check more stable (#1271)
9a1e3dfTerry Kong8mo ago
fix: deepscaler-24k test reduce to 10 steps to safely finish in 4 hr (#1280)
806e285Terry Kong8mo ago

fix: parallel state initialization error in Megatron to HF model conversion (#1120)

00cb570Stan Kirdey8mo ago

build: Fix ngc pytorch build with deep-ep (#1234)

1f979e0Charlie Truong8mo ago

chore: Revert "chore: 0.4.0.rc0 -> 0.4.0 (#1285)" (#1296)

5efbe4fCharlie Truong8mo ago

Top contributors

Builders behind this project.