LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Ecosystems

Python

Megatron-LM

Ongoing research training transformer models at scale

PythonEmerging
GitHubWebsite
Stars
—
Forks
—
Contributors
8
Last push
2mo ago

Recent commits

Latest commits.

  • do not add EoD (#3526)
    a6d6dc6Adi Renduchintala4mo ago
  • Renable full_iteration cuda graphs for inference. Add them for the mamba block. (#3250)
    32efeffSiddharth Singh4mo ago
  • Update copy-pr-bot.yaml [skip ci]
    b555bafGitHub Actions4mo ago
  • Fix Megatron-FSDP optimizer state DCP checkpointing, and fix DTensor deepcopy bug from PyTorch 26.01. (#3510)
    773c113Cory Ye4mo ago
  • Multimodal: fix argument checking (#3449)
    01b361cFaradawn Yang4mo ago
ci: Also sync direct teams (#3484)
e8fd432oliver könig4mo ago
  • ci: Enable Dependabot Automerge (#3487)
    7a36263oliver könig4mo ago
  • ci: MBridge testing branch name during merge-queues (#3513)
    b7aa6a0oliver könig4mo ago
  • Top contributors

    Builders behind this project.

    ko3n1g
    1.7K commits
    jaredcasper
    1K commits
    lmcafee-nvidia
    399 commits
    shanmugamr1992
    391 commits
    shoeybi
    339 commits
    deepakn94
    317 commits
    ericharper
    282 commits
    mikolajblaz
    242 commits