Loreon
Labs
Platform
Docs
Home
Ecosystems
Other
Megatron-LM
Ongoing research training transformer models at scale
Other
Emerging
GitHub
Website
Stars
1
Forks
—
Contributors
8
Last push
2mo ago
Recent commits
Latest commits.
Update copy-pr-bot.yaml [skip ci]
3460bba
GitHub Actions
2mo ago
Fixes for modelopt examples and SFTTokenizer for transformers v5 (#4450)
83e7466
Jenny Chen
2mo ago
New allgathervdispatcher for inference and simplify old dispatcher. (#4258)
bfd4574
Siddharth Singh
2mo ago
fix(ci): add retry with backoff to approve-test-queue bot (#4559)
dcb2bd2
oliver könig
2mo ago
Avoid nsys profile crash with CUDA graphs (#4541)
12f18da
Teodor-Dumitru Ene
2mo ago
Support YAML quant recipe in PTQ and remove first/last layer modifier code (#4503)
1a83320
Jenny Chen
2mo ago
docs: use @file-path notation for file references in skills (#4542)
77afc60
oliver könig
2mo ago
Fix release tests: remove --global-batch-size conflicting with --step-batch-size-schedule (#4545)
580d53a
Deepak Narayanan
2mo ago
Top contributors
Builders behind this project.
ko3n1g
1.8K commits
jaredcasper
1K commits
lmcafee-nvidia
406 commits
shanmugamr1992
392 commits
shoeybi
339 commits
deepakn94
323 commits
ericharper
282 commits
mikolajblaz
242 commits