InternLM/internlm/solver
Wenwen Qu 375240e039
feat(moe): add local data parallel support for experts (#376)
* add local data parallel support for experts

* fix model checkpoint for local dp mode of expert

* do not set ep size from config
2023-09-28 13:38:02 +08:00
..
optimizer feat(moe): add local data parallel support for experts (#376) 2023-09-28 13:38:02 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00
beta2_scheduler.py initial commit 2023-07-06 12:55:23 +08:00
lr_scheduler.py feat(solver): fix code to adapt to torch2.0 and provide docker images (#128) 2023-07-25 19:34:52 +08:00
pipeline_utils.py initial commit 2023-07-06 12:55:23 +08:00