3694 Commits (37e35230ff4666231dd65435b5f7b2a2fcfaf9e6)
 

Author SHA1 Message Date
Stephan Kö 45c49dde96
[Auto Parallel]: Speed up intra-op plan generation by 44% (#5446) 4 months ago
YeAnbang b3594d4d68 fix orpo cross entropy loss 4 months ago
pre-commit-ci[bot] 51f916b11d [pre-commit.ci] auto fixes from pre-commit.com hooks 4 months ago
BurkeHulk 1f1b856354 Merge remote-tracking branch 'origin/feature/fp8_comm' into feature/fp8_comm 4 months ago
BurkeHulk 66018749f3 add fp8_communication flag in the script 4 months ago
BurkeHulk e88190184a support fp8 communication in pipeline parallelism 4 months ago
BurkeHulk 1e1959467e fix scaling algorithm in FP8 casting 4 months ago
Hongxin Liu c068ef0fa0
[zero] support all-gather overlap (#5898) 4 months ago
YeAnbang 115c4cc5a4 hotfix citation 4 months ago
YeAnbang e7a8634636 fix eval 4 months ago
YeAnbang dd9e1cdafe
Merge pull request #5850 from hpcaitech/rlhf_SimPO 4 months ago
pre-commit-ci[bot] 8a9721bafe [pre-commit.ci] auto fixes from pre-commit.com hooks 5 months ago
YeAnbang 33f15203d3 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO 5 months ago
YeAnbang f6ef5c3609 fix style 5 months ago
YeAnbang d888c3787c add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint 5 months ago
GuangyaoZhang dbfa7d39fc fix typo 5 months ago
Guangyao Zhang 669849d74b
[ShardFormer] Add Ulysses Sequence Parallelism support for Command-R, Qwen2 and ChatGLM (#5897) 5 months ago
YeAnbang 16f3451fe2 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO 5 months ago
Edenzzzz fbf33ecd01
[Feature] Enable PP + SP for llama (#5868) 5 months ago
Runyu Lu 66abf1c6e8
[HotFix] CI,import,requirements-test for #5838 (#5892) 5 months ago
Runyu Lu cba20525a8
[Feat] Diffusion Model(PixArtAlpha/StableDiffusion3) Support (#5838) 5 months ago
Edenzzzz 8ec24b6a4d
[Hoxfix] Fix CUDA_DEVICE_MAX_CONNECTIONS for comm overlap 5 months ago
Haze188 3420921101
[shardformer] DeepseekMoE support (#5871) 5 months ago
pre-commit-ci[bot] e17f835df7 [pre-commit.ci] auto fixes from pre-commit.com hooks 5 months ago
Hanks 6991819a97
Merge branch 'hpcaitech:main' into feature/fp8_comm 5 months ago
pre-commit-ci[bot] 7997683aac
[pre-commit.ci] pre-commit autoupdate (#5878) 5 months ago
Hongxin Liu 7afbc81d62
[quant] fix bitsandbytes version check (#5882) 5 months ago
Wang Binluo 6cd4c32be4
[shardformer] fix the moe (#5883) 5 months ago
Edenzzzz eb24fcd914
[Hotfix] Fix OPT gradient checkpointing forward 5 months ago
Haze188 ea94c07b95
[hotfix] fix the bug that large tensor exceed the maximum capacity of TensorBucket (#5879) 5 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572) 5 months ago
Edenzzzz 936d0b0f7b
[doc] Update llama + sp compatibility; fix dist optim table 5 months ago
Jianghai 8ab46b4000
[Shardformer] change qwen2 modeling into gradient checkpointing style (#5874) 5 months ago
HangXu f5a52e1600
fp8 operators for compressed communication 5 months ago
YeAnbang ff535204fe update transformers version 5 months ago
Haze188 416580b314
[MoE/ZeRO] Moe refactor with zero refactor (#5821) 5 months ago
YeAnbang a8af6ccb73 fix torch colossalai version 5 months ago
flybird11111 773d9f964a
[shardformer]delete xformers (#5859) 5 months ago
YeAnbang e7527762a1 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO 5 months ago
Hongxin Liu eaea88cf9e
[release] update version (#5864) 5 months ago
Runyu Lu 3c7cda0c9a
[Inference]Lazy Init Support (#5785) 5 months ago
Guangyao Zhang d9d5e7ea1f
[shardformer] Support the T5ForTokenClassification model (#5816) 5 months ago
Hongxin Liu 5dfbcd7746
[zero] use bucket during allgather (#5860) 5 months ago
YeAnbang b117274074 fix colossalai, transformers version 5 months ago
YeAnbang afa53066ca fix colossalai, transformers version 5 months ago
YeAnbang 384c64057d fix colossalai, transformers version 5 months ago
YeAnbang 8aad064fe7 fix style 5 months ago
YeAnbang c8d1b4a968 add orpo 5 months ago
botbw 8e718a1421
[gemini] fixes for benchmarking (#5847) 5 months ago
Edenzzzz 2a25a2aff7
[Feature] optimize PP overlap (#5735) 5 months ago