Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
Megatron-LM
Ongoing research training transformer models at scale
Python
Emerging
GitHub
Website
Stars
—
Forks
—
Contributors
8
Last push
3mo ago
Recent commits
Latest commits.
Merge branch 'decoder_input' into 'main'
cbc89b3
Oliver Koenig
15mo ago
ADLR/megatron-lm!2967 - Remove obsolete reference to decoder_input
530092b
Keshav Santhanam
15mo ago
Merge branch 'helenn-evals-completions-api' into 'main'
e5d3bf7
Deepak Narayanan
15mo ago
ADLR/megatron-lm!2959 - Re-enable completions endpoint
74f8865
Helen Ngo
15mo ago
Merge branch 'helenn-bert-failure-fix' into 'main'
876ff5a
Oliver Koenig
15mo ago
ADLR/megatron-lm!2994 - Fix BERT signature in legacy path
32ea646
Helen Ngo
15mo ago
Merge branch 'shifang/moe_with_calculate_per_token_loss' into 'main'
6dfb5c2
Oliver Koenig
15mo ago
ADLR/megatron-lm!2851 - fix(moe): Fix MoE Aux loss scaling when calculate_per_token_loss=True
87d9d25
Shifang Xu
15mo ago
Top contributors
Builders behind this project.
jaredcasper
995 commits
ko3n1g
820 commits
lmcafee-nvidia
365 commits
shanmugamr1992
361 commits
shoeybi
339 commits
ericharper
270 commits
deepakn94
261 commits
mikolajblaz
235 commits