InternLM/internlm/core
Wenwen Qu 12c614db94 create expert data group and broadcast moe parameter in expert data group 2023-08-21 11:40:39 +08:00
..
communication fix(ci): fix ci train error (#199) 2023-08-15 20:09:54 +08:00
context create expert data group and broadcast moe parameter in expert data group 2023-08-21 11:40:39 +08:00
scheduler Merge branch 'develop' into feature_add_moe 2023-08-17 16:37:06 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00
engine.py initial commit 2023-07-06 12:55:23 +08:00
gradient_handler.py initial commit 2023-07-06 12:55:23 +08:00
naive_amp.py refactor(scheduler): rewrite pipeline scheduler (#138) 2023-08-03 11:48:12 +08:00
trainer.py feat(core/scheduler): support pipeline parallel (#98) 2023-07-24 20:52:09 +08:00