-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix Bug: Configuring Datasets with train-data-path, valid-data-path, test-data-path
#840
opened May 27, 2024 by
Eisenhower
Loading…
[Fix] Assertion to check if
num_layers
is divisible by the pipeline size
#823
opened May 13, 2024 by
kenkenpa2126
Loading…
Fix incorrect
src
argument in broadcast_params
function
#796
opened Apr 26, 2024 by
Yuxin-CV
Loading…
fix loading distributed checkpoint when enable auto-detect-ckpt-format but disable use-dist-ckpt
#794
opened Apr 24, 2024 by
imh966
Loading…
fix a mistake when check if num_layers dividable by vpp
#781
opened Apr 16, 2024 by
constroy
Loading…
Fix typo in README.md
stale
No activity in 60 days on issue or PR
#751
opened Mar 26, 2024 by
HashiamKadhim
Loading…
Support S3 checkpointing for the torch strategy in distributed checkpointing
#748
opened Mar 22, 2024 by
jrocmar
Loading…
[BUG FIX] Fix world_size bug in QuickStart Example
stale
No activity in 60 days on issue or PR
#747
opened Mar 22, 2024 by
Mr-Philo
Loading…
Update outdated method name passed to get linear_layer function to match intented method that was imported
stale
No activity in 60 days on issue or PR
#740
opened Mar 18, 2024 by
OckermanSethGVSU
Loading…
Replace outdated import path of get_forward_backward_func in eval_utils.py
stale
No activity in 60 days on issue or PR
#734
opened Mar 14, 2024 by
OckermanSethGVSU
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.