Commit Graph

3574 Commits (9179d4088e378c437178432168dea9f32fbf739f)
 

Author SHA1 Message Date
GuangyaoZhang 363cde6957 merge model and attention forward
5 months ago
GuangyaoZhang 7a2b08646f Remove CohereLayerNorm and use existing layernorm
5 months ago
GuangyaoZhang fe2e74c03a fix precommit
5 months ago
GuangyaoZhang 98da648a4a Fix Code Factor check
5 months ago
GuangyaoZhang f656d61778 change command
5 months ago
GuangyaoZhang 0b81163bc0 Copy llama to command
5 months ago
Edenzzzz 8795bb2e80
Support 4d parallel + flash attention (#5789)
5 months ago
GuangyaoZhang 3c7302ad0e merge model and attention forward
5 months ago
GuangyaoZhang 8c3f524660 Remove CohereLayerNorm and use existing layernorm
6 months ago
GuangyaoZhang c9025ebd7c Merge branch 'command-r' of github.com:GuangyaoZhang/ColossalAI into command-r
6 months ago
GuangyaoZhang 9a290ab013 fix precommit
6 months ago
pre-commit-ci[bot] 2a7fa2e7d0 [pre-commit.ci] auto fixes from pre-commit.com hooks
6 months ago
GuangyaoZhang 1016bb3257 Fix Code Factor check
6 months ago
GuangyaoZhang 94fbde6055 change command
6 months ago
GuangyaoZhang 431b7bcf8f Copy llama to command
6 months ago
flybird11111 2ddf624a86
[shardformer] upgrade transformers to 4.39.3 (#5815)
6 months ago
botbw 3bcbba9262
[gemini] quick fix on possible async operation (#5803)
6 months ago
Haze188 d9dddf574f
[Gemini] Use async stream to prefetch and h2d data moving (#5781)
6 months ago
Li Xingjian 8554585a5f
[Inference] Fix flash-attn import and add model test (#5794)
6 months ago
Guangyao Zhang aac941ef78
[test] fix qwen2 pytest distLarge (#5797)
6 months ago
Hongxin Liu aa125bcc91
[shardformer] fix modeling of bloom and falcon (#5796)
6 months ago
Hongxin Liu 587bbf4c6d
[test] fix chatglm test kit (#5793)
6 months ago
YeAnbang 74f4a29734
Merge pull request #5759 from hpcaitech/colossalchat_upgrade
6 months ago
Runyu Lu c0948aff97
[Inference]refactor baichuan (#5791)
6 months ago
YeAnbang 84eab13078 update sft trainning script
6 months ago
Li Xingjian 77a219a082
Merge pull request #5771 from char-1ee/refactor/modeling
6 months ago
char-1ee b303976a27 Fix test import
6 months ago
YeAnbang 2abdede1d7 fix readme
6 months ago
char-1ee f5981e808e Remove flash attention backend
6 months ago
YeAnbang 77db21610a replace the customized dataloader setup with the build-in one
6 months ago
YeAnbang 0d7ff10ea5 replace the customized dataloader setup with the build-in one
6 months ago
char-1ee ceba662d22 Clean up
6 months ago
char-1ee 5f398fc000 Pass inference model shard configs for module init
6 months ago
char-1ee eec77e5702 Fix tests and naming
6 months ago
char-1ee 04386d9eff Refactor modeling by adding attention backend
6 months ago
YeAnbang 790e1362a6 merge
6 months ago
YeAnbang ac1520cb8f remove baichuan from template test due to transformer version conflict
6 months ago
YeAnbang e16ccc272a update ci
6 months ago
YeAnbang 45195ac53d remove local data path
6 months ago
YeAnbang bf57b13dda remove models that require huggingface auth from ci
6 months ago
YeAnbang 0bbac158ed fix datasets version
6 months ago
YeAnbang 62eb28b929 remove duplicated test
6 months ago
YeAnbang b8b5cacf38 fix transformers version
6 months ago
pre-commit-ci[bot] 1b880ce095 [pre-commit.ci] auto fixes from pre-commit.com hooks
6 months ago
YeAnbang b1031f7244 fix ci
6 months ago
YeAnbang 7ae87b3159 fix training script
6 months ago
YeAnbang 0b4a33548c moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy
6 months ago
YeAnbang 7e65b71815 run pre-commit
6 months ago
YeAnbang 929e1e3da4 upgrade ppo dpo rm script
6 months ago
YeAnbang 7a7e86987d upgrade colossal-chat support tp_group>1, add sp for sft
6 months ago