ColossalAI/colossalai/nn/layer/moe
HELSON bccbc15861
[MOE] changed parallelmode to dist process group (#460)
2022-03-19 13:46:29 +08:00
..
__init__.py [MOE] changed parallelmode to dist process group (#460) 2022-03-19 13:46:29 +08:00
_operation.py [MOE] changed parallelmode to dist process group (#460) 2022-03-19 13:46:29 +08:00
experts.py Added TPExpert for special situation 2022-03-11 15:50:28 +08:00
layers.py added Multiply Jitter and capacity factor eval for MOE (#434) 2022-03-16 16:47:44 +08:00
utils.py added Multiply Jitter and capacity factor eval for MOE (#434) 2022-03-16 16:47:44 +08:00