Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
Megatron-LM
Ongoing research training transformer models at scale
Other
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
37mo ago
Recent commits
Latest commits.
Merge branch 'interleaved-pipeline-bugfix' into 'main'
992da75
Jared Casper
37mo ago
Perform grad sync at correct place in interleaved pipeline parallelism
ea76ecd
Tim Moon
37mo ago
Merge branch 'outputtensor_index' into 'main'
d2891b4
Jared Casper
37mo ago
fix indexation for output tensor after gradscaler call
41221b8
Abhinav Khattar
37mo ago
Merge branch 'lmcafee/rotary-kwarg-dev' into 'main'
382fd9d
Jared Casper
37mo ago
Fixed rotary_pos_emb's position in layer's forward args.
a6c574d
Lawrence McAfee
37mo ago
Merge branch 'gptdataset-assert' into 'main'
f965380
Jared Casper
37mo ago
Fix GPTDataset assert.
1997e94
Jared Casper
37mo ago
Top contributors
Builders behind this project.
jaredcasper
442 commits
shoeybi
338 commits
lmcafee-nvidia
190 commits
kvareddy
114 commits
mpatwary
94 commits
RPrenger
85 commits
zliucr
78 commits
deepakn94
66 commits