Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
1mo ago
Recent commits
Latest commits.
[megatron] fix: ValueError when unpacking preprocess_thd_engine result in router replay (#5891)
b45ee32
guillemgt
2mo ago
[model] fix: replace inplace += with out-of-place addition in dummy visual forward (#5881)
f92bc5c
reonokiy
2mo ago
[megatron] fix: enable_routing_replay fails with MLATransformerConfig… (#5884)
b90edef
Vadim Vorobev
2mo ago
[cfg] fix: sync strategy from ActorConfig/CriticConfig to EngineConfig (#5885)
74dc16c
Yifan Wu
2mo ago
[ci] fix: fix machine label for nightly_ascend.yml (#5887)
5b3b597
yyyy2000
2mo ago
[4/n][trainer] feat: flowgrpo - add diffusers + fsdp engine support (#5802)
d5c5daa
Cheung Ka Wai
3mo ago
[trainer] fix: handle empty response_mask in calculate_debug_metrics (#5860)
ebb5663
Jackie2049
3mo ago
[megatron] fix: support critic model (#5870)
1625391
Joel
3mo ago
Top contributors
Builders behind this project.
vermouth1992
168 commits
eric-haibin-lin
157 commits
HollowMan6
89 commits
wuxibin89
86 commits
ETOgaosion
86 commits
PeterSH6
85 commits
tongyx361
73 commits
yyDing1
54 commits