InternLM/internlm
Wenwen Qu a1f99b64bc Merge branch 'feature_add_moe' of https://github.com/blankde/InternLM into feature_add_moe 2023-08-23 13:52:29 +08:00
..
apis initial commit 2023-07-06 12:55:23 +08:00
core change the scale position for latent moe_loss 2023-08-23 13:25:20 +08:00
data refactor(solver/optimizer): improve optimizer memory (#193) 2023-08-11 17:46:07 +08:00
initialize Merge main to develop (#203) 2023-08-16 15:57:26 +08:00
model Merge branch 'develop' into feature_add_moe 2023-08-17 16:37:06 +08:00
moe avoid moe parameter partition in zero optimizer 2023-08-15 17:52:51 +08:00
monitor feat(monitor): support monitor and alert (#175) 2023-08-08 11:18:15 +08:00
solver change the scale position for latent moe_loss 2023-08-23 13:25:20 +08:00
utils Merge branch 'feature_add_moe' of github.com:blankde/InternLM into feature_add_moe_pp_zl 2023-08-17 17:00:04 +08:00
__init__.py initial commit 2023-07-06 12:55:23 +08:00