InternLM/internlm
Wenwen Qu 582ee000bd
feat(moe):support zero for expert local dp (#404)
* support zero for expert local dp

* fix above codes:
    *treat optim.zero_world_size and optim.zero_local_rank as list in model_checkpoint.py and test_model_checkpoint.py
    *add overlap and zero check for moe in args_sanity_check(.)
2023-10-09 17:45:26 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core fix(pipeline): fix bugs for pipeline when enable mixed precision (#382) 2023-10-09 14:01:15 +08:00
data Merge develop to main (#233) 2023-08-24 22:03:04 +08:00
initialize feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
model feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
moe feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
monitor doc(monitor): add light monitoring doc (#352) 2023-09-25 19:28:09 +08:00
solver feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
train feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
utils feat(moe):support zero for expert local dp (#404) 2023-10-09 17:45:26 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00