InternLM/internlm/solver
Wenwen Qu 582ee000bd
feat(moe):support zero for expert local dp (#404)
* support zero for expert local dp

* fix above codes:
    *treat optim.zero_world_size and optim.zero_local_rank as list in model_checkpoint.py and test_model_checkpoint.py
    *add overlap and zero check for moe in args_sanity_check(.)
2023-10-09 17:45:26 +08:00
..
optimizer feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00
beta2_scheduler.py initial commit 2023-07-06 12:55:23 +08:00
lr_scheduler.py feat(solver): fix code to adapt to torch2.0 and provide docker images (#128) 2023-07-25 19:34:52 +08:00
pipeline_utils.py initial commit 2023-07-06 12:55:23 +08:00