Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Cherry-pick PR#2581 and PR#2776 to main
#3075 opened Jan 26, 2026 by BestJuly Loading…
6 tasks
Core 0.16
(Draft)[Main] fix cg missing wgrad hook bug Something isn't working
#3074 opened Jan 26, 2026 by Wohox Loading…
6 tasks
debug: Unit test failures
#3073 opened Jan 25, 2026 by chtruong814 Loading…
6 tasks
Core 0.16
[Megatron-FSDP] Add dtype customization to Megatron-FSDP. Expert Review Apply this label to indicate that your PR is ready for expert review. module: megatron-fsdp
#3067 opened Jan 24, 2026 by cspades Loading…
3 of 6 tasks
Core 0.16
Sajadn/extract embd
#3060 opened Jan 23, 2026 by sajadn Loading…
μP: Maximal Update Parameterization community-request
#3058 opened Jan 23, 2026 by plugyawn Loading…
3 of 6 tasks
docs: Release docs
#3055 opened Jan 23, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
fix(fsdp): add CLI argument for outer_dp_sharding_strategy community-request Final Review Apply this label to indicate that your PR is ready for final review. module: megatron-fsdp needs-follow-up Issue needs follow-up
#3053 opened Jan 23, 2026 by liuyun7345 Loading…
4 tasks done
Core 0.16
Added --ft-num-warmup-iters option. complexity: low Final Review Apply this label to indicate that your PR is ready for final review. module: training
#3052 opened Jan 23, 2026 by hexinw-nvidia Loading… Core 0.16
added vllm fakequant export support
#3050 opened Jan 22, 2026 by kinjalpatel27 Loading…
6 tasks
[dev] pull main 260122
#3045 opened Jan 22, 2026 by FDecaYed Loading…
6 tasks
Core 0.16
Add absorbed-mla
#3044 opened Jan 22, 2026 by kunlunl Loading…
6 tasks
[fix] Bug fix for offloading in evaluate() Expert Review Apply this label to indicate that your PR is ready for expert review. needs-follow-up Issue needs follow-up
#3043 opened Jan 22, 2026 by lhb8125 Loading…
6 tasks
Core 0.16
Fuse MLA DOWN projection GEMMs community-request complexity: medium dev2main: mbridge dev to main: this PR is needed in main for mbridge Expert Review Apply this label to indicate that your PR is ready for expert review.
#3039 opened Jan 22, 2026 by cjld Loading…
6 tasks
Core 0.16
ProTip! Updated in the last three days: updated:>2026-01-23.