Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
5mo ago
Recent commits
Latest commits.
[megatron] feat: Using MTP in RL Training and Inference (#4936)
df340d7
arron
5mo ago
[sglang, rollout] feat: support sglang as rollout engine in fully async policy (#4191)
f104dfa
Peng Zhang
5mo ago
[doc, data] fix: resolve broken documentation hyperlinks (#4970)
5053d42
aphrodite1028
5mo ago
[training_utils] A bug that caused device selection in group statistics to fail has been covered by tests. (#4967)
75d8b00
JohnConnor123
5mo ago
[training_utils] fix: Correct Attention TFLOPS estimation & fix CI (#4959)
54582f1
HaochenYuan
5mo ago
[trainer] fix: pass scores device type to `group_mean_std` call (#4962)
5689fd7
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟
5mo ago
[training_utils] fix: correctly `_resolve_device` when not specified (#4961)
e9c43b9
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟
5mo ago
[env] fix: upgrade torch, cudnn and deps versions in vllm image to fix performance issue (#4960)
65eb5a1
Begunner
5mo ago
Top contributors
Builders behind this project.
vermouth1992
157 commits
eric-haibin-lin
157 commits
ETOgaosion
83 commits
PeterSH6
82 commits
HollowMan6
69 commits
tongyx361
67 commits
wuxibin89
65 commits
FightingZhen
44 commits