InternLM

History

Wenwen Qu 375240e039 feat(moe): add local data parallel support for experts (#376 ) * add local data parallel support for experts * fix model checkpoint for local dp mode of expert * do not set ep size from config		2023-09-28 13:38:02 +08:00
..
optimizer	feat(moe): add local data parallel support for experts (#376 )	2023-09-28 13:38:02 +08:00
__init__.py	initial commit	2023-07-06 12:55:23 +08:00
beta2_scheduler.py	initial commit	2023-07-06 12:55:23 +08:00
lr_scheduler.py	feat(solver): fix code to adapt to torch2.0 and provide docker images (#128 )	2023-07-25 19:34:52 +08:00
pipeline_utils.py	initial commit	2023-07-06 12:55:23 +08:00