ColossalAI/colossalai/nn/layer/moe
HELSON dbdc9a7783
added Multiply Jitter and capacity factor eval for MOE (#434)
2022-03-16 16:47:44 +08:00
..
__init__.py Added TPExpert for special situation 2022-03-11 15:50:28 +08:00
_operation.py Added TPExpert for special situation 2022-03-11 15:50:28 +08:00
experts.py Added TPExpert for special situation 2022-03-11 15:50:28 +08:00
layers.py added Multiply Jitter and capacity factor eval for MOE (#434) 2022-03-16 16:47:44 +08:00
utils.py added Multiply Jitter and capacity factor eval for MOE (#434) 2022-03-16 16:47:44 +08:00