Other
Minimalistic large language model 3D-parallelism training on lightning speed, fork from huggingface nanotron
Latest commits.
Builders behind this project.