69 Commits (cf519dac6a5799b8f314aac6f510e2a98d3af9c6)

Author SHA1 Message Date
Wenxuan Tan 8fd25d6e09
[Feature] Split cross-entropy computation in SP (#5959) 2 months ago
Wang Binluo eea37da6fa
[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016) 3 months ago
Edenzzzz f5c84af0b0
[Feature] Zigzag Ring attention (#5905) 3 months ago
flybird11111 0c10afd372
[FP8] rebase main (#5963) 4 months ago
haze188 034020bd04 [misc] remove debug/print code 4 months ago
haze188 b2952a5982 [moe] deepseek moe sp support 4 months ago
Haze188 404b16faf3 [Feature] MoE Ulysses Support (#5918) 4 months ago
hxwang 46c069b0db [zero] solve hang 4 months ago
hxwang a249e71946 [test] mixtra pp shard test 4 months ago
hxwang 0b76b57cd6 [test] add mixtral transformer test 4 months ago
Guangyao Zhang d9d5e7ea1f
[shardformer] Support the T5ForTokenClassification model (#5816) 5 months ago
GuangyaoZhang fe2e74c03a fix precommit 5 months ago
GuangyaoZhang f656d61778 change command 5 months ago
GuangyaoZhang 9a290ab013 fix precommit 5 months ago
pre-commit-ci[bot] 2a7fa2e7d0 [pre-commit.ci] auto fixes from pre-commit.com hooks 5 months ago
GuangyaoZhang 94fbde6055 change command 5 months ago
Hongxin Liu 587bbf4c6d
[test] fix chatglm test kit (#5793) 5 months ago
botbw 80c3c8789b
[Test/CI] remove test cases to reduce CI duration (#5753) 6 months ago
Wang Binluo a3cc68ca93
[Shardformer] Support the Qwen2 model (#5699) 7 months ago
Hongxin Liu bbb2c21f16
[shardformer] fix chatglm implementation (#5644) 7 months ago
Wang Binluo 0d0a582033
[shardformer] update transformers (#5583) 7 months ago
Hongxin Liu 641b1ee71a
[devops] remove post commit ci (#5566) 8 months ago
Zhongkai Zhao 8e412a548e
[shardformer] Sequence Parallelism Optimization (#5533) 8 months ago
Wenhao Chen e614aa34f3
[shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508) 8 months ago
Frank Lee d69cd2eb89
[workflow] fixed oom tests (#5275) 10 months ago
Frank Lee d5eeeb1416
[ci] fixed booster test (#5251) 11 months ago
flybird11111 21aa5de00b
[gemini] hotfix NaN loss while using Gemini + tensor_parallel (#5150) 12 months ago
Wenhao Chen 7172459e74
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
Jianghai cf579ff46d
[Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
Hongxin Liu b8e770c832
[test] merge old components to test to model zoo (#4945) 1 year ago
Jianghai ce7ade3882
[inference] chatglm2 infer demo (#4724) 1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752) 1 year ago
digger yu 9c2feb2f0b
fix some typo with colossalai/device colossalai/tensor/ etc. (#4171) 1 year ago
flybird11111 eedaa3e1ef
[shardformer]fix gpt2 double head (#4663) 1 year ago
flybird11111 7486ed7d3a
[shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 1 year ago
Hongxin Liu 27061426f7
[gemini] improve compatibility and add static placement policy (#4479) 1 year ago
flybird11111 59e252ecdb
[shardformer] chatglm support sequence parallel (#4482) 1 year ago
Jianghai 5545114fd8
rename chatglm to chatglm2 (#4484) 1 year ago
flybird11111 108e54a0b4 [shardformer]update t5 tests for using all optimizations. (#4407) 1 year ago
flybird11111 1edc9b5fb3 [shardformer] update tests for all optimization (#4413) 1 year ago
Baizhou Zhang 7711bd524a [shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) 1 year ago
Jianghai 7596e9ae08 [pipeline] rewrite bert tests and fix some bugs (#4409) 1 year ago
flybird1111 906426cb44 [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
Jianghai a88e92251d [pipeline] add chatglm (#4363) 1 year ago
Baizhou Zhang b1feeced8e [shardformer] add util functions for shardformer tests/fix sync_shared_param (#4366) 1 year ago
Bin Jia 5c6f183192 [test] Hotfix/fix some model test and refactor check util api (#4369) 1 year ago
FoolPlayer 879301d0da [shardformer] support Blip2 (#4243) 1 year ago
klhhhhh 8120eca0c0 [shardformer] support ChatGLMForConditionalGeneration & add fusedlayernorm for vit 1 year ago
klhhhhh 4da05052f4 [shardformer] pre-commit check files 1 year ago
klhhhhh f155ae89c4 [shardformer] ChatGLM support layernorm sharding 1 year ago