Commit Graph

135 Commits (997544c1f90b9a1549e91a6d97ee3902c2ac0ed4)

Author SHA1 Message Date
Hongxin Liu b5f0566363
[chat] add distributed PPO trainer (#3740)
1 year ago
Yuanchen 57a6d7685c
support evaluation for english (#3880)
1 year ago
Yuanchen 2506e275b8
[evaluation] improvement on evaluation (#3862)
2 years ago
digger yu e2d81eba0d
[nfc] fix typo colossalai/ applications/ (#3831)
2 years ago
Yuanchen 34966378e8
[evaluation] add automatic evaluation pipeline (#3821)
2 years ago
digger yu 9265f2d4d7
[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779)
2 years ago
github-actions[bot] 62c7e67f9f
[format] applied code formatting on changed files in pull request 3786 (#3787)
2 years ago
binmakeswell ad2cf58f50
[chat] add performance and tutorial (#3786)
2 years ago
Yuanchen 05759839bd
[chat] fix bugs in stage 3 training (#3759)
2 years ago
digger-yu ad6460cf2c
[NFC] fix typo applications/ and colossalai/ (#3735)
2 years ago
digger-yu b7141c36dd
[CI] fix some spelling errors (#3707)
2 years ago
MisterLin1995 f7361ee1bd
[chat] fix community example ray (#3719)
2 years ago
zhang-yi-chi 2da5d81dec
[chat] fix train_prompts.py gemini strategy bug (#3666)
2 years ago
digger-yu 65bdc3159f
fix some spelling error with applications/Chat/examples/ (#3692)
2 years ago
Tong Li b36e67cb2b
Merge pull request #3680 from digger-yu/digger-yu-patch-2
2 years ago
Camille Zhong 0f785cb1f3
[chat] PPO stage3 doc enhancement (#3679)
2 years ago
digger-yu 6650daeb0a
[doc] fix chat spelling error (#3671)
2 years ago
Hongxin Liu 7bd0bee8ea
[chat] add opt attn kernel (#3655)
2 years ago
digger-yu 8ba7858753
Update generate_gpt35_answers.py
2 years ago
digger-yu bfbf650588
fix spelling error
2 years ago
tanitna 1a60dc07a8
[chat] typo accimulation_steps -> accumulation_steps (#3662)
2 years ago
Tong Li 816add7e7f
Merge pull request #3656 from TongLi3701/chat/update_eval
2 years ago
binmakeswell 268b3cd80d
[chat] set default zero2 strategy (#3667)
2 years ago
Tong Li c1a355940e update readme
2 years ago
Tong Li ed3eaa6922 update documentation
2 years ago
Tong Li c419117329 update questions and readme
2 years ago
Tong Li aa77ddae33 remove unnecessary step and update readme
2 years ago
Hongxin Liu 842768a174
[chat] refactor model save/load logic (#3654)
2 years ago
Hongxin Liu 6ef7011462
[chat] remove lm model class (#3653)
2 years ago
Camille Zhong 8bccb72c8d
[Doc] enhancement on README.md for chat examples (#3646)
2 years ago
Hongxin Liu 2a951955ad
[chat] refactor trainer (#3648)
2 years ago
Hongxin Liu f8288315d9
[chat] polish performance evaluator (#3647)
2 years ago
Hongxin Liu 50793b35f4
[gemini] accelerate inference (#3641)
2 years ago
Tong Li e1b0a78afa
Merge pull request #3621 from zhang-yi-chi/fix/chat-train-prompts-single-gpu
2 years ago
ddobokki df309fc6ab
[Chat] Remove duplicate functions (#3625)
2 years ago
zhang-yi-chi 739cfe3360 [chat] fix enable single gpu training bug
2 years ago
digger-yu d7bf284706
[chat] polish code note typo (#3612)
2 years ago
Yuanchen c4709d34cf
Chat evaluate (#3608)
2 years ago
binmakeswell 5a79cffdfd
[coati] fix install cmd (#3592)
2 years ago
Yuanchen 1ec0d386a9
reconstruct chat trainer and fix training script (#3588)
2 years ago
Camille Zhong 36a519b49f Update test_ci.sh
2 years ago
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
Fazzie-Maqianli 6b1a39b17b
[coati] add costom model suppor tguide (#3579)
2 years ago
binmakeswell cc1eec2f53
[chat] update reward model sh (#3578)
2 years ago
csric e355144375
[chatgpt] Detached PPO Training (#3195)
2 years ago
MisterLin1995 1a809eddaa
[chat] ChatGPT train prompts on ray example (#3309)
2 years ago
binmakeswell 535b896435
[chat] polish tutorial doc (#3551)
2 years ago
Yuanchen 7182ac2a04
[chat]add examples of training with limited resources in chat readme (#3536)
2 years ago
zhang-yi-chi e6a132a449
[chat]: add vf_coef argument for PPOTrainer (#3318)
2 years ago
ver217 89fd10a1c9
[chat] add zero2 cpu strategy for sft training (#3520)
2 years ago