ColossalAI/colossalai/shardformer/layer
Frank Lee f3b6aaa6b7 [shardformer] supported fused normalization (#4112) 2023-07-04 16:05:01 +08:00
..
__init__.py [shardformer] supported fused normalization (#4112) 2023-07-04 16:05:01 +08:00
_operation.py [shardformer] support vision transformer (#4096) 2023-07-04 16:05:01 +08:00
dropout.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00
embedding.py [shardformer] supported fused qkv checkpoint (#4073) 2023-07-04 16:05:01 +08:00
linear.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00
loss.py [shardformer] refactored the shardformer layer structure (#4053) 2023-07-04 16:05:01 +08:00
normalization.py [shardformer] supported fused normalization (#4112) 2023-07-04 16:05:01 +08:00
parallel_module.py [shardformer] supported fused qkv checkpoint (#4073) 2023-07-04 16:05:01 +08:00
qkv_fused_linear.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00
utils.py [shardformer] supported bloom model (#4098) 2023-07-04 16:05:01 +08:00