InternLM/internlm/moe
Wenwen Qu 21624f6f81
fix(moe): remove norm&gate force sync (#448)
* add zero broadcast_sync

* delete old sync logic

* fix merged error

* refactor code

* remove some unused function (is norm/gate group)
2023-11-01 11:29:55 +08:00
..
experts.py feat(moe): add moe module (#182) 2023-09-27 15:54:53 +08:00
sharded_moe.py fix(moe): remove norm&gate force sync (#448) 2023-11-01 11:29:55 +08:00