ColossalAI/colossalai/nn/layer
HELSON f7f2248771
[moe] fix MoE bugs (#1628)
* remove forced FP32 modules

* correct no_shard-contexts' positions
2022-09-22 13:56:30 +08:00
..
colossalai_layer [NFC] polish colossalai/nn/layer/colossalai_layer/dropout.py code style (#1568) 2022-09-08 22:11:04 +08:00
moe [moe] fix MoE bugs (#1628) 2022-09-22 13:56:30 +08:00
parallel_1d add gather_output for VocabParallelClassifier1D (#1569) 2022-09-08 16:40:56 +08:00
parallel_2d [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
parallel_2p5d [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
parallel_3d [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00
parallel_sequence [NFC] polish colossalai/nn/layer/parallel_sequence/layers.py code style (#1280) 2022-07-13 12:08:21 +08:00
utils [NFC] polish colossalai/nn/layer/utils/common.py code style (#983) 2022-05-17 10:25:06 +08:00
vanilla [NFC] polish colossalai/nn/layer/vanilla/layers.py code style (#1295) 2022-07-13 12:08:21 +08:00
wrapper [NFC] polish colossalai/nn/layer/wrapper/pipeline_wrapper.py code style (#1303) 2022-07-13 19:01:07 +08:00
__init__.py [MOE] changed parallelmode to dist process group (#460) 2022-03-19 13:46:29 +08:00
base_layer.py [utils] refactor parallel layers checkpoint and bcast model on loading checkpoint (#1548) 2022-09-06 20:18:35 +08:00