Loreon
Labs
Platform
Docs
Home
Ecosystems
Python
Megatron-LM-perl
MikaStars39/Megatron-LM-perl
Python
Emerging
GitHub
Stars
—
Forks
—
Contributors
8
Last push
1mo ago
Recent commits
Latest commits.
Merge pull request #6 from MikaStars39/rooplus
6ad4036
ChenYXxxx
2mo ago
fix: avoid mutating live SOAP optimizer state in sharded_state_dict
f283ee6
keming.3
2mo ago
fix: handle ShardedTensorFactory in SOAP preconditioner sharding
9103b4c
keming.3
2mo ago
fix: propagate prepend_axis_num for SOAP preconditioner sharding
33831a0
MikaStars39
2mo ago
fix: use 1D stacking for SOAP preconditioner sharding under TP
4068aa1
MikaStars39
2mo ago
fix: use temporary config mutation for SOAP Adam path
9fad900
MikaStars39
2mo ago
fix: guard init_state_fn for None in sharded_state_dict
2cd8f8e
MikaStars39
2mo ago
fix: avoid mutating shared config and ensure fp32 dtype in SOAP init
f6569da
MikaStars39
2mo ago
Top contributors
Builders behind this project.
ko3n1g
1.7K commits
jaredcasper
1K commits
lmcafee-nvidia
389 commits
shanmugamr1992
386 commits
shoeybi
339 commits
deepakn94
307 commits
ericharper
282 commits
mikolajblaz
242 commits