ColossalAI/colossalai/shardformer/modeling
pre-commit-ci[bot] f360deb6d0 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-02-03 17:59:57 +00:00
..
chatglm2_6b [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00
__init__.py
bert.py [fp8] support hybrid parallel plugin (#5982) 2024-08-12 18:17:05 +08:00
blip2.py
bloom.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
chatglm2.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-02-03 17:59:57 +00:00
command.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
deepseek.py [moe] add parallel strategy for shared_expert && fix test for deepseek (#6063) 2024-09-18 10:09:01 +08:00
falcon.py
gpt2.py fix 2024-10-15 13:28:33 +08:00
gptj.py [fp8] support hybrid parallel plugin (#5982) 2024-08-12 18:17:05 +08:00
jit.py
llama.py [Zerobubble] merge main. (#6142) 2024-11-19 19:00:36 +08:00
mistral.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
mixtral.py [Zerobubble] merge main. (#6142) 2024-11-19 19:00:36 +08:00
opt.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
qwen2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
sam.py
t5.py
vit.py
whisper.py