InternLM/internlm
Wenwen Qu 14a81e5c1d add codes for reducing moe parameters in expert data group 2023-08-22 17:29:28 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core create expert data group and broadcast moe parameter in expert data group 2023-08-21 11:40:39 +08:00
data refactor(solver/optimizer): improve optimizer memory (#193) 2023-08-11 17:46:07 +08:00
initialize Merge main to develop (#203) 2023-08-16 15:57:26 +08:00
model Merge branch 'develop' into feature_add_moe 2023-08-17 16:37:06 +08:00
moe avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
monitor feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
solver add codes for reducing moe parameters in expert data group 2023-08-22 17:29:28 +08:00
utils Merge branch 'develop' into feature_add_moe 2023-08-17 16:37:06 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00