-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: wait for async P2P send before deallocating output tensor
#4047
opened Mar 28, 2026 by
ZhiyuLi-Nvidia
•
Draft
5 tasks
Fix HybridDeviceOptimizer KeyError after mixed-precision param replacement
community-request
Final Review
PR is in the "final review" stage
#4046
opened Mar 28, 2026 by
ma-ben
Loading…
Docs: improve docstrings and comments in example training loop
community-request
#4041
opened Mar 27, 2026 by
DhineshPonnarasan
•
Draft
[Kernel] Fused Indexer Loss Kernel
community-request
#4039
opened Mar 27, 2026 by
laixinn
Loading…
5 tasks
Fix unnecessary permute padding for non-quantized MoE dispatch
community-request
needs-follow-up
Issue needs follow-up
#4038
opened Mar 27, 2026 by
xiaoxi-wangfj
Loading…
5 tasks
Add stacklevel to warnings.warn() calls across megatron/core
community-request
#4036
opened Mar 26, 2026 by
Bias92
Loading…
3 of 5 tasks
docs: add AGENT.md with CI/CD and failure navigation guidance
docs-only
documentation only (docs or docstrings)
fix fine_grained_callables with fused rmsnorm residual
complexity: low
dev2main: mbridge
dev to main: this PR is needed in main for mbridge
Final Review
PR is in the "final review" stage
m-fsdp: wire use_precision_aware_optimizer from ddp_config to ParamAn…
complexity: low
Final Review
PR is in the "final review" stage
module: megatron-fsdp
[DO NOT MERGE] Combined MiMo non-colocated changes for MBridge integration
#4022
opened Mar 24, 2026 by
yashaswikarnati
•
Draft
Allow the evaluation batch size to differ from the training batch size
#4014
opened Mar 24, 2026 by
michal2409
Loading…
4 of 5 tasks
Fix spike-no-more embedding weight decay regression (#3945)
community-request
#4012
opened Mar 24, 2026 by
DhineshPonnarasan
•
Draft
Rename moe_router_force_biased to moe_router_force_biased_std and only apply if non-zero
#4011
opened Mar 24, 2026 by
denys-fridman
Loading…
5 tasks
fix: improve text generation CLI validation and clean up duplicate imports
community-request
Final Review
PR is in the "final review" stage
needs-follow-up
Issue needs follow-up
#4010
opened Mar 24, 2026 by
nathon-lee
Loading…
NVFP4 native weights for DDP
26.04
this PR is high priority and should be merged asap
community-request
complexity: high
Final Review
PR is in the "final review" stage
needs-follow-up
Issue needs follow-up
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.