LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

NeMo-RL-ProRL

Scalable toolkit for efficient model reinforcement

PythonEmerging
GitHubWebsite
Stars
1
Forks
—
Contributors
8
Last push
7mo ago

Recent commits

Latest commits.

  • fix: Reinitialize model parallel after import (#1317)
    d726c38Yubo Gao8mo ago
  • fix: enhancing non-colocated refit performance by having inclusive comm group (#1264)
    57046a4Youngeun Kwon8mo ago
  • feat: Add deepseek flops tracker (original #1250) (#1305)
    3fccb63Guyue Huang8mo ago
  • fix: qwen32 nightly metric check more stable (#1271)
    9a1e3dfTerry Kong8mo ago
  • fix: deepscaler-24k test reduce to 10 steps to safely finish in 4 hr (#1280)
    806e285Terry Kong8mo ago
fix: parallel state initialization error in Megatron to HF model conversion (#1120)
00cb570Stan Kirdey8mo ago
  • build: Fix ngc pytorch build with deep-ep (#1234)
    1f979e0Charlie Truong8mo ago
  • chore: Revert "chore: 0.4.0.rc0 -> 0.4.0 (#1285)" (#1296)
    5efbe4fCharlie Truong8mo ago
  • Top contributors

    Builders behind this project.

    terrykong
    128 commits
    parthchadha
    56 commits
    ashors1
    55 commits
    chtruong814
    43 commits
    yuki-97
    41 commits
    SahilJain314
    39 commits
    yfw
    24 commits
    ZhiyuLi-Nvidia
    12 commits