Loreon
Labs
Platform
Docs
Home
Ecosystems
Shell
PeRL
PeRL: Parameter-Efficient Reinforcement Learning
Shell
Emerging
GitHub
Website
Stars
80
Forks
12
Contributors
7
Last push
1mo ago
Recent commits
Latest commits.
Merge pull request #40 from MikaStars39/feature/async-perf-p0-p1
6c9f85a
MikaStar39
1mo ago
add nanoeval evaluation module with backend, reward, and utils
59e0b89
MikaStars39
1mo ago
[chore] remove deprecated qwen experiment scripts
9cd8ab4
MikaStars39
2mo ago
[refactor] rename hf2mgt to hf2mcore and add mcore2hf converter
e4ad447
MikaStars39
2mo ago
[doc] add OOM recovery and parallelism change guide to SFT README
4114e9a
MikaStars39
2mo ago
[perf] increase max-tokens-per-gpu to 128k for better B300 utilization
ac3bbe3
MikaStars39
2mo ago
[fix] add B300 (sm_103a) compatibility fixes to SFT script
db150bb
MikaStars39
2mo ago
[fix] switch wandb to offline mode in B300 SFT script
f141f9b
MikaStars39
2mo ago
Top contributors
Builders behind this project.
MikaStars39
179 commits
shangshang-wang
18 commits
memset0
4 commits
Alic-Li
3 commits
ChenYXxxx
2 commits
TwT-JD
2 commits
lbertge
1 commits