You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/shardformer/layer
Guangyao Zhang 669849d74b
[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897)
5 months ago
..
moe
__init__.py [Feature] Enable PP + SP for llama (#5868) 5 months ago
_operation.py [ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897) 5 months ago
attn.py
dropout.py
embedding.py
linear.py
loss.py [Feature] Enable PP + SP for llama (#5868) 5 months ago
normalization.py
parallel_module.py
qkv_fused_linear.py
utils.py