InternLM/internlm/moe
Wenwen Qu 929dd36cf2 avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
..
experts.py avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
sharded_moe.py modified: internlm/model/moe.py 2023-08-08 16:46:14 +08:00