Skip to content

Pull requests: aws-samples/awsome-distributed-training

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fixes for observability addOn
#922 opened Dec 30, 2025 by Madhubalasri-B Loading…
Feat/verl/megatron
#921 opened Dec 18, 2025 by KeitaW Draft
slurm: set containerd root to EBS
#914 opened Dec 7, 2025 by maekawataiki Loading…
mtc with rlvr
#912 opened Nov 28, 2025 by mvinci12 Loading…
Remove AWS_OFI_NCCL_VERSION
#911 opened Nov 27, 2025 by pbelevich Loading…
refactor: enhance hostfile_topologify.py readability
#909 opened Nov 26, 2025 by Zhenye-Na Loading…
Expert parallelism benchmarks
#901 opened Nov 19, 2025 by pbelevich Loading…
Adding nanoVLM sample
#864 opened Sep 25, 2025 by allela-roy Loading…
NeMo 2 Performance instructions
#812 opened Aug 5, 2025 by pbelevich Loading…
delete users script in hyperpod
#807 opened Aug 4, 2025 by cszhz Loading…
Feature/slinky slurm hyperpod eks
#804 opened Aug 1, 2025 by bdaqiq01 Loading…
adding nemo2.0 eks test case
#688 opened May 21, 2025 by KeitaW Draft
Feat/ddp mlflow enhancement New feature or request
#655 opened Apr 28, 2025 by KeitaW Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.