Commit Graph

3676 Commits (8fd25d6e09069a8437c6ebee8dd83e1de4c9b83d)
 

Author SHA1 Message Date
Edenzzzz fbf33ecd01
[Feature] Enable PP + SP for llama (#5868)
5 months ago
Runyu Lu 66abf1c6e8
[HotFix] CI,import,requirements-test for #5838 (#5892)
5 months ago
Runyu Lu cba20525a8
[Feat] Diffusion Model(PixArtAlpha/StableDiffusion3) Support (#5838)
5 months ago
Edenzzzz 8ec24b6a4d
[Hoxfix] Fix CUDA_DEVICE_MAX_CONNECTIONS for comm overlap
5 months ago
Haze188 3420921101
[shardformer] DeepseekMoE support (#5871)
5 months ago
pre-commit-ci[bot] e17f835df7 [pre-commit.ci] auto fixes from pre-commit.com hooks
5 months ago
Hanks 6991819a97
Merge branch 'hpcaitech:main' into feature/fp8_comm
5 months ago
pre-commit-ci[bot] 7997683aac
[pre-commit.ci] pre-commit autoupdate (#5878)
5 months ago
Hongxin Liu 7afbc81d62
[quant] fix bitsandbytes version check (#5882)
5 months ago
Wang Binluo 6cd4c32be4
[shardformer] fix the moe (#5883)
5 months ago
Edenzzzz eb24fcd914
[Hotfix] Fix OPT gradient checkpointing forward
5 months ago
Haze188 ea94c07b95
[hotfix] fix the bug that large tensor exceed the maximum capacity of TensorBucket (#5879)
5 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572)
5 months ago
Edenzzzz 936d0b0f7b
[doc] Update llama + sp compatibility; fix dist optim table
5 months ago
Jianghai 8ab46b4000
[Shardformer] change qwen2 modeling into gradient checkpointing style (#5874)
5 months ago
HangXu f5a52e1600
fp8 operators for compressed communication
5 months ago
YeAnbang ff535204fe update transformers version
5 months ago
Haze188 416580b314
[MoE/ZeRO] Moe refactor with zero refactor (#5821)
5 months ago
YeAnbang a8af6ccb73 fix torch colossalai version
5 months ago
flybird11111 773d9f964a
[shardformer]delete xformers (#5859)
5 months ago
YeAnbang e7527762a1 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO
5 months ago
Hongxin Liu eaea88cf9e
[release] update version (#5864)
5 months ago
Runyu Lu 3c7cda0c9a
[Inference]Lazy Init Support (#5785)
5 months ago
Guangyao Zhang d9d5e7ea1f
[shardformer] Support the T5ForTokenClassification model (#5816)
5 months ago
Hongxin Liu 5dfbcd7746
[zero] use bucket during allgather (#5860)
5 months ago
YeAnbang b117274074 fix colossalai, transformers version
5 months ago
YeAnbang afa53066ca fix colossalai, transformers version
5 months ago
YeAnbang 384c64057d fix colossalai, transformers version
5 months ago
YeAnbang 8aad064fe7 fix style
5 months ago
YeAnbang c8d1b4a968 add orpo
5 months ago
botbw 8e718a1421
[gemini] fixes for benchmarking (#5847)
5 months ago
Edenzzzz 2a25a2aff7
[Feature] optimize PP overlap (#5735)
5 months ago
binmakeswell 4ccaaaab63
[doc] add GPU cloud playground (#5851)
5 months ago
YeAnbang f3de5a025c remove debug code
5 months ago
YeAnbang 0b2d6275c4 fix dataloader
5 months ago
YeAnbang 4b59d874df Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into main
5 months ago
YeAnbang 82aecd6374 add SimPO
5 months ago
binmakeswell 7266f82d03
[doc] fix open sora model weight link (#5848)
5 months ago
binmakeswell 8f445729a4
[doc] opensora v1.2 news (#5846)
5 months ago
botbw 8a5c86439a
[gemini] fix missing return (#5845)
5 months ago
Hongxin Liu bd3e34fef6
[release] update version (#5833)
5 months ago
Yuanheng Zhao 7b249c76e5
[Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers (#5837)
5 months ago
Guangyao Zhang fd1dc417d8
[shardformer] Change atol in test command-r weight-check to pass pytest (#5835)
5 months ago
Guangyao Zhang 2014cce870
[devops] Remove building on PR when edited to avoid skip issue (#5836)
5 months ago
Kai Lv 0adca5b688
[launch] Support IPv4 host initialization in launch (#5822)
5 months ago
Guangyao Zhang 639394b0d4
Merge pull request #5818 from GuangyaoZhang/command-r
5 months ago
Edenzzzz 7f9ec599be
[misc] Add dist optim to doc sidebar (#5806)
5 months ago
GuangyaoZhang 4adbc36913 Merge branch 'command-r' of github.com:GuangyaoZhang/ColossalAI into command-r
5 months ago
GuangyaoZhang d84d68601a change 'xxx if xxx else None' to 'xxx or None'
5 months ago
pre-commit-ci[bot] 996c65077e [pre-commit.ci] auto fixes from pre-commit.com hooks
5 months ago