3394 Commits (colossalchat_upgrade)
 

Author SHA1 Message Date
YeAnbang 82aecd6374 add SimPO 5 months ago
YeAnbang 84eab13078 update sft trainning script 5 months ago
YeAnbang 2abdede1d7 fix readme 5 months ago
YeAnbang 77db21610a replace the customized dataloader setup with the build-in one 6 months ago
YeAnbang 0d7ff10ea5 replace the customized dataloader setup with the build-in one 6 months ago
YeAnbang 790e1362a6 merge 6 months ago
YeAnbang ac1520cb8f remove baichuan from template test due to transformer version conflict 6 months ago
YeAnbang e16ccc272a update ci 6 months ago
YeAnbang 45195ac53d remove local data path 6 months ago
YeAnbang bf57b13dda remove models that require huggingface auth from ci 6 months ago
YeAnbang 0bbac158ed fix datasets version 6 months ago
YeAnbang 62eb28b929 remove duplicated test 6 months ago
YeAnbang b8b5cacf38 fix transformers version 6 months ago
pre-commit-ci[bot] 1b880ce095 [pre-commit.ci] auto fixes from pre-commit.com hooks 6 months ago
YeAnbang b1031f7244 fix ci 6 months ago
YeAnbang 7ae87b3159 fix training script 6 months ago
YeAnbang 0b4a33548c moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 6 months ago
YeAnbang 7e65b71815 run pre-commit 6 months ago
YeAnbang 929e1e3da4 upgrade ppo dpo rm script 6 months ago
YeAnbang 7a7e86987d upgrade colossal-chat support tp_group>1, add sp for sft 6 months ago
Hongxin Liu 73e88a5553
[shardformer] fix import (#5788) 6 months ago
Hongxin Liu 5ead00ffc5
[misc] update requirements (#5787) 6 months ago
flybird11111 a1e39f4c0d
[install]fix setup (#5786) 6 months ago
Hongxin Liu b9d646fe9e
[misc] fix dist logger (#5782) 6 months ago
Charles Coulombe c46e09715c
Allow building cuda extension without a device. (#5535) 6 months ago
botbw 3f7e3131d9
[gemini] optimize reduce scatter d2h copy (#5760) 6 months ago
duanjunwen 10a19e22c6
[hotfix] fix testcase in test_fx/test_tracer (#5779) 6 months ago
botbw 80c3c8789b
[Test/CI] remove test cases to reduce CI duration (#5753) 6 months ago
Edenzzzz 79f7a7b211
[misc] Accelerate CI for zero and dist optim (#5758) 6 months ago
flybird11111 50b4c8e8cf
[hotfix] fix llama flash attention forward (#5777) 6 months ago
yuehuayingxueluo b45000f839
[Inference]Add Streaming LLM (#5745) 6 months ago
Hongxin Liu ee6fd38373
[devops] fix docker ci (#5780) 6 months ago
Hongxin Liu 32f4187806
[misc] update dockerfile (#5776) 6 months ago
Haze188 e22b82755d
[CI/tests] simplify some test case to reduce testing time (#5755) 6 months ago
Yuanheng Zhao 406443200f
[Hotfix] Add missing init file in inference.executor (#5774) 6 months ago
duanjunwen 1b76564e16
[test] Fix/fix testcase (#5770) 6 months ago
flybird11111 3f2be80530
fix (#5765) 6 months ago
Hongxin Liu 68359ed1e1
[release] update version (#5752) 6 months ago
Yuanheng Zhao 677cbfacf8
[Fix/Example] Fix Llama Inference Loading Data Type (#5763) 6 months ago
botbw 023ea13cb5
Merge pull request #5749 from hpcaitech/prefetch 6 months ago
hxwang 154720ba6e [chore] refactor profiler utils 6 months ago
hxwang 8547562884 [chore] remove unnecessary assert since compute list might not be recorded 6 months ago
hxwang e5e3320948 [bug] continue fix 6 months ago
hxwang 936dd96dbb [bug] workaround for idx fix 6 months ago
botbw e0dde8fda5
Merge pull request #5754 from Hz188/prefetch 6 months ago
botbw 157b4cc357
Merge branch 'prefetch' into prefetch 6 months ago
genghaozhe 87665d7922 correct argument help message 6 months ago
Haze188 4d097def96
[Gemini] add some code for reduce-scatter overlap, chunk prefetch in llama benchmark. (#5751) 6 months ago
genghaozhe b9269d962d add args.prefetch_num for benchmark 6 months ago
genghaozhe fba04e857b [bugs] fix args.profile=False DummyProfiler errro 6 months ago