ColossalAI

Making large AI models cheaper, faster and more accessible

ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism

History

Hongxin Liu dc2cdaf3e8 [shardformer] optimize seq parallelism (#6086 ) * [shardformer] optimize seq parallelism * [shardformer] fix gpt2 fused linear col * [plugin] update gemini plugin * [plugin] update moe hybrid plugin * [test] update gpt2 fused linear test * [shardformer] fix gpt2 fused linear reduce		1 month ago
..
__init__.py	[shardformer] init shardformer code structure (#3731 )	1 year ago
auto_policy.py	[FP8] rebase main (#5963 )	4 months ago
base_policy.py	[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016 )	3 months ago
bert.py	[shardformer] optimize seq parallelism (#6086 )	1 month ago
blip2.py	[shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084 )	1 month ago
bloom.py	[shardformer] optimize seq parallelism (#6086 )	1 month ago
chatglm2.py	[shardformer] optimize seq parallelism (#6086 )	1 month ago
command.py	[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016 )	3 months ago
deepseek.py	[hotfix] moe hybrid parallelism benchmark & follow-up fix (#6048 )	2 months ago
falcon.py	[fp8] support hybrid parallel plugin (#5982 )	3 months ago
gpt2.py	[shardformer] optimize seq parallelism (#6086 )	1 month ago
gptj.py	[shardformer] optimize seq parallelism (#6086 )	1 month ago
llama.py	[Feature] Split cross-entropy computation in SP (#5959 )	2 months ago
mistral.py	[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016 )	3 months ago
mixtral.py	[fp8] fix missing fp8_comm flag in mixtral (#6057 )	2 months ago
opt.py	[fp8] support hybrid parallel plugin (#5982 )	3 months ago
qwen2.py	[Feature] Zigzag Ring attention (#5905 )	3 months ago
sam.py	[shardformer] fix linear 1d row and support uneven splits for fused qkv linear (#6084 )	1 month ago
t5.py	[fp8] support hybrid parallel plugin (#5982 )	3 months ago
vit.py	[fp8] support hybrid parallel plugin (#5982 )	3 months ago
whisper.py	[fp8] support hybrid parallel plugin (#5982 )	3 months ago