InternLM/internlm/solver/optimizer
gaoyang07 483bd706dd fix when resuming lr_scheduler without loading optimizer 2023-12-29 15:15:16 +08:00
..
__init__.py feat(train): add fsdp training option (#293) 2023-10-09 18:59:31 +08:00
base_optimizer.py feat(train): add fsdp training option (#293) 2023-10-09 18:59:31 +08:00
fsdp_optimizer.py fix when resuming lr_scheduler without loading optimizer 2023-12-29 15:15:16 +08:00
hybrid_zero_optim.py fix when resuming lr_scheduler without loading optimizer 2023-12-29 15:15:16 +08:00
store.py feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
utils.py fix token grad norm with tp (#547) 2023-12-18 18:33:28 +08:00