Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Wenxuan Tan 62c13e7969
[Ring Attention] Improve comments (#6085)
1 month ago
..
__init__.py [shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084) 1 month ago
_operation.py [shardformer] optimize seq parallelism (#6086) 1 month ago
attn.py [Ring Attention] Improve comments (#6085) 1 month ago
dropout.py [misc] update pre-commit and run all files (#4752) 1 year ago
embedding.py [fp8] support hybrid parallel plugin (#5982) 3 months ago
linear.py [shardformer] optimize seq parallelism (#6086) 1 month ago
loss.py [Feature] Split cross-entropy computation in SP (#5959) 2 months ago
normalization.py [fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016) 3 months ago
parallel_module.py [shardformer] refactor embedding resize (#5603) 7 months ago
qkv_fused_linear.py [shardformer] optimize seq parallelism (#6086) 1 month ago
utils.py [Ring Attention] Improve comments (#6085) 1 month ago