LoreonLabsPlatform

Overview

Intelligence

Markets
Builders
Research
Narratives
Ecosystems
Launchpads

Discover

Search
Sources

Other

RL

Scalable toolkit for efficient model reinforcement

OtherEmerging

Stars

—

Forks

—

Contributors

8

Last push

6mo ago

Recent commits

Latest commits.

feat: add dapo recipe and test (#1617)
56e8fcbZhiyu Li6mo ago
fix: Fix crash when using activation_checkpointing (#1676)
02d5142Yi-Fu Wu6mo ago
fix: Fix fp8 after vllm v0.11.2 bump (#1660)
b238e41Guyue Huang6mo ago
test: Perf recipe for v0.5 (#1667)
fab6234Guyue Huang6mo ago
fix: Fix Fp8 sequence padding for PP>1 case (#1579)
d0651ddGuyue Huang6mo ago

fix: Fix crash when using cp in dtensor path (#1663)

91658c8Yi-Fu Wu6mo ago

fix: Handle disabled validation in SFT training (#1611)

4794ca7sahgerlad6mo ago

fix: Support datasets saved with save_to_disk in ResponseDataset (#1610)

48dbb37sahgerlad6mo ago

Top contributors

Builders behind this project.

youngeunkwon0405