Commit Graph

254 Commits (1b76564e1607aa8cf24566c794977b260de44f6c)

Author SHA1 Message Date
Xu Kai 1ce997daaf [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271)
1 year ago
shenggan 798cb72907 [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260)
1 year ago
Zheng Zangwei (Alex Zheng) b2debdc09b [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259)
1 year ago
CZYCW dee1c96344 [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250)
1 year ago
Junming Wu 77c469e1ba [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248)
1 year ago
Camille Zhong 915ed8bed1 [NFC] polish applications/Chat/inference/requirements.txt code style (#4265)
1 year ago
Frank Lee f447ca1811 [chat] removed cache file (#4155)
1 year ago
wukong1992 c1c672d0f0 [shardformer] shardformer support t5 model (#3994)
1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117)
1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094)
1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083)
1 year ago
Michelle e89b127d8e
[chat]: fix chat evaluation possible bug (#4064)
1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987)
1 year ago
digger yu 727c4598a9
[nfc] fix dim not defined and fix typo (#3991)
1 year ago
digger yu d4fb7bfda7
fix typo applications/Chat/coati/ (#3947)
1 year ago
Yuanchen 2925f47399
[evaluate] support gpt evaluation with reference (#3972)
1 year ago
Wenhao Chen 9d02590c9a
[chat] refactor actor class (#3968)
1 year ago
Yuanchen 21c4c0b1a0
support UniEval and add CHRF metric (#3924)
1 year ago
Hongxin Liu b5f0566363
[chat] add distributed PPO trainer (#3740)
2 years ago
Yuanchen 57a6d7685c
support evaluation for english (#3880)
2 years ago
Yuanchen 2506e275b8
[evaluation] improvement on evaluation (#3862)
2 years ago
digger yu e2d81eba0d
[nfc] fix typo colossalai/ applications/ (#3831)
2 years ago
Yuanchen 34966378e8
[evaluation] add automatic evaluation pipeline (#3821)
2 years ago
digger yu 9265f2d4d7
[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779)
2 years ago
github-actions[bot] 62c7e67f9f
[format] applied code formatting on changed files in pull request 3786 (#3787)
2 years ago
binmakeswell ad2cf58f50
[chat] add performance and tutorial (#3786)
2 years ago
Yuanchen 05759839bd
[chat] fix bugs in stage 3 training (#3759)
2 years ago
digger-yu ad6460cf2c
[NFC] fix typo applications/ and colossalai/ (#3735)
2 years ago
digger-yu b7141c36dd
[CI] fix some spelling errors (#3707)
2 years ago
MisterLin1995 f7361ee1bd
[chat] fix community example ray (#3719)
2 years ago
zhang-yi-chi 2da5d81dec
[chat] fix train_prompts.py gemini strategy bug (#3666)
2 years ago
digger-yu 65bdc3159f
fix some spelling error with applications/Chat/examples/ (#3692)
2 years ago
Tong Li b36e67cb2b
Merge pull request #3680 from digger-yu/digger-yu-patch-2
2 years ago
Camille Zhong 0f785cb1f3
[chat] PPO stage3 doc enhancement (#3679)
2 years ago
digger-yu 6650daeb0a
[doc] fix chat spelling error (#3671)
2 years ago
Hongxin Liu 7bd0bee8ea
[chat] add opt attn kernel (#3655)
2 years ago
digger-yu 8ba7858753
Update generate_gpt35_answers.py
2 years ago
digger-yu bfbf650588
fix spelling error
2 years ago
tanitna 1a60dc07a8
[chat] typo accimulation_steps -> accumulation_steps (#3662)
2 years ago
Tong Li 816add7e7f
Merge pull request #3656 from TongLi3701/chat/update_eval
2 years ago
binmakeswell 268b3cd80d
[chat] set default zero2 strategy (#3667)
2 years ago
Tong Li c1a355940e update readme
2 years ago
Tong Li ed3eaa6922 update documentation
2 years ago
Tong Li c419117329 update questions and readme
2 years ago
Tong Li aa77ddae33 remove unnecessary step and update readme
2 years ago
Hongxin Liu 842768a174
[chat] refactor model save/load logic (#3654)
2 years ago
Hongxin Liu 6ef7011462
[chat] remove lm model class (#3653)
2 years ago
Camille Zhong 8bccb72c8d
[Doc] enhancement on README.md for chat examples (#3646)
2 years ago
Hongxin Liu 2a951955ad
[chat] refactor trainer (#3648)
2 years ago
Hongxin Liu f8288315d9
[chat] polish performance evaluator (#3647)
2 years ago
Hongxin Liu 50793b35f4
[gemini] accelerate inference (#3641)
2 years ago
Tong Li e1b0a78afa
Merge pull request #3621 from zhang-yi-chi/fix/chat-train-prompts-single-gpu
2 years ago
ddobokki df309fc6ab
[Chat] Remove duplicate functions (#3625)
2 years ago
zhang-yi-chi 739cfe3360 [chat] fix enable single gpu training bug
2 years ago
digger-yu d7bf284706
[chat] polish code note typo (#3612)
2 years ago
Yuanchen c4709d34cf
Chat evaluate (#3608)
2 years ago
binmakeswell 5a79cffdfd
[coati] fix install cmd (#3592)
2 years ago
Yuanchen 1ec0d386a9
reconstruct chat trainer and fix training script (#3588)
2 years ago
Camille Zhong 36a519b49f Update test_ci.sh
2 years ago
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
Fazzie-Maqianli 6b1a39b17b
[coati] add costom model suppor tguide (#3579)
2 years ago
binmakeswell cc1eec2f53
[chat] update reward model sh (#3578)
2 years ago
csric e355144375
[chatgpt] Detached PPO Training (#3195)
2 years ago
MisterLin1995 1a809eddaa
[chat] ChatGPT train prompts on ray example (#3309)
2 years ago
binmakeswell 535b896435
[chat] polish tutorial doc (#3551)
2 years ago
Yuanchen 7182ac2a04
[chat]add examples of training with limited resources in chat readme (#3536)
2 years ago
zhang-yi-chi e6a132a449
[chat]: add vf_coef argument for PPOTrainer (#3318)
2 years ago
ver217 89fd10a1c9
[chat] add zero2 cpu strategy for sft training (#3520)
2 years ago
binmakeswell 990d4c3e4e
[doc] hide diffusion in application path (#3519)
2 years ago
binmakeswell 0c0455700f
[doc] add requirement and highlight application (#3516)
2 years ago
NatalieC323 635d0a1baf
[Chat Community] Update README.md (fixed#3487) (#3506)
2 years ago
gongenlei a7ca297281
[coati] Fix LlamaCritic (#3475)
2 years ago
binmakeswell 891b8e7fac
[chat] fix stage3 PPO sample sh command (#3477)
2 years ago
Fazzie-Maqianli 6afeb1202a
add community example dictionary (#3465)
2 years ago
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452)
2 years ago
YY Lin 62f4e2eb07
[Chat]Add Peft support & fix the ptx bug (#3433)
2 years ago
Dr-Corgi 73afb63594
[chat]fix save_model(#3377)
2 years ago
kingkingofall 57a3c4db6d
[chat]fix readme (#3429)
2 years ago
Camille Zhong 72cb4dd433
[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453)
2 years ago
Yuanchen b92313903f
fix save_model indent error in ppo trainer (#3450)
2 years ago
Yuanchen 773955abfa
fix save_model inin naive and ddp strategy (#3436)
2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424)
2 years ago
Yuanchen b09adff724
[chat]fix sft training for bloom, gpt and opt (#3418)
2 years ago
Camille Zhong 30412866e0
[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223)
2 years ago
Andrew 82132f4e3d
[chat] correcting a few obvious typos and grammars errors (#3338)
2 years ago
Fazzie-Maqianli 0fbadce79c
[doc] added authors to the chat application (#3307)
2 years ago
BlueRum b512893637
Polish readme link (#3306)
2 years ago
github-actions[bot] cb413ccf28
[format] applied code formatting on changed files in pull request 3300 (#3302)
2 years ago
binmakeswell 31c78f2be3
[doc] add ColossalChat news (#3304)
2 years ago
Frank Lee e235a24673
[application] updated the README (#3301)
2 years ago
BlueRum 8257e1055d
[chat]polish prompts training (#3300)
2 years ago
ver217 62f7156131
[coati] fix inference profanity check (#3299)
2 years ago
github-actions[bot] 5134ad5d1a
[format] applied code formatting on changed files in pull request 3296 (#3298)
2 years ago
BlueRum c8b723d6c2
[chat]Update Readme (#3296)
2 years ago
ver217 73b542a124
[coati] inference supports profanity check (#3295)
2 years ago
ver217 ce2cafae76
[coati] add repetition_penalty for inference (#3294)
2 years ago
Fazzie-Maqianli a88ed0f83a
add limit (#3293)
2 years ago
Fazzie-Maqianli c5484281aa
[ColossalChat]add cite for datasets (#3292)
2 years ago
Fazzie-Maqianli ec7af22a43
fix image (#3288)
2 years ago
Fazzie-Maqianli 1f7d9afbf8
add example (#3286)
2 years ago
ver217 4905b21b94
[coati] fix inference output (#3285)
2 years ago
Fazzie-Maqianli bb6196e71a
remove chatgpt (#3284)
2 years ago
Fazzie-Maqianli b0ce5a1032
[Coati] first commit (#3283)
2 years ago
binmakeswell d32ef94ad9
[doc] fix typo (#3222)
2 years ago
ver217 78fd31f9c1
[chatgpt] add precision option for colossalai (#3233)
2 years ago
Fazzie-Maqianli bd39877da4
support instrcut training (#3230)
2 years ago
Camille Zhong 9bc702ab48
[doc] update chatgpt doc paper link (#3229)
2 years ago
Fazzie-Maqianli bbac6760e5
fix torch version (#3225)
2 years ago
Fazzie-Maqianli fa97a9cab4
[chatgpt] unnify datasets (#3218)
2 years ago
Fazzie-Maqianli 4fd4bd9d9a
[chatgpt] support instuct training (#3216)
2 years ago
Yuanchen 9998d5ef64
[chatgpt]add reward model code for deberta (#3199)
2 years ago
Fazzie-Maqianli 1e1b9d2fea
[chatgpt]support llama (#3070)
2 years ago
pgzhang b429529365
[chatgpt] add supervised learning fine-tune code (#3183)
2 years ago
BlueRum 7548ca5a54
[chatgpt]Reward Model Training Process update (#3133)
2 years ago
ver217 1e58d31bb7
[chatgpt] fix trainer generate kwargs (#3166)
2 years ago
ver217 c474fda282
[chatgpt] fix ppo training hanging problem with gemini (#3162)
2 years ago
binmakeswell 3c01280a56
[doc] add community contribution guide (#3153)
2 years ago
BlueRum 23cd5e2ccf
[chatgpt]update ci (#3087)
2 years ago
BlueRum 68577fbc43
[chatgpt]Fix examples (#3116)
2 years ago
BlueRum 0672b5afac
[chatgpt] fix lora support for gpt (#3113)
2 years ago
hiko2MSP 191daf7411
[chatgpt] type miss of kwargs (#3107)
2 years ago
BlueRum c9dd036592
[chatgpt] fix lora save bug (#3099)
2 years ago
Fazzie-Maqianli 02ae80bf9c
[chatgpt]add flag of action mask in critic(#3086)
2 years ago
wenjunyang b51bfec357
[chatgpt] change critic input as state (#3042)
2 years ago
Fazzie-Maqianli c21b11edce
change nn to models (#3032)
2 years ago
github-actions[bot] e86d9bb2e1
[format] applied code formatting on changed files in pull request 3025 (#3026)
2 years ago
BlueRum 55dcd3051a
[chatgpt] fix readme (#3025)
2 years ago
LuGY 287d60499e
[chatgpt] Add saving ckpt callback for PPO (#2880)
2 years ago
BlueRum e588703454
[chatgpt]fix inference model load (#2988)
2 years ago
ver217 0ff8406b00
[chatgpt] allow shard init and display warning (#2986)
2 years ago
BlueRum f5ca0397dd
[chatgpt] fix lora gemini conflict in RM training (#2984)
2 years ago
ver217 19ad49fb3b
[chatgpt] making experience support dp (#2971)
2 years ago
BlueRum c9e27f0d1b
[chatgpt]fix lora bug (#2974)
2 years ago
BlueRum 82149e9d1b
[chatgpt] fix inference demo loading bug (#2969)
2 years ago
Fazzie-Maqianli bbf9c827c3
[ChatGPT] fix README (#2966)
2 years ago
binmakeswell b0a8766381
[doc] fix chatgpt inference typo (#2964)
2 years ago
BlueRum 489a9566af
[chatgpt]add inference example (#2944)
2 years ago
binmakeswell 8264cd7ef1
[doc] add env scope (#2933)
2 years ago
BlueRum 2e16f842a9
[chatgpt]support opt & gpt for rm training (#2876)
2 years ago
BlueRum 34ca324b0d
[chatgpt] Support saving ckpt in examples (#2846)
2 years ago
BlueRum 3eebc4dff7
[chatgpt] fix rm eval (#2829)
2 years ago
ver217 b6a108cb91
[chatgpt] add test checkpoint (#2797)
2 years ago
ver217 a619a190df
[chatgpt] update readme about checkpoint (#2792)
2 years ago
ver217 4ee311c026
[chatgpt] startegy add prepare method (#2766)
2 years ago
ver217 a88bc828d5
[chatgpt] disable shard init for colossalai (#2767)
2 years ago
BlueRum 613efebc5c
[chatgpt] support colossalai strategy to train rm (#2742)
2 years ago
BlueRum 648183a960
[chatgpt]fix train_rm bug with lora (#2741)
2 years ago
CH.Li 7aacfad8af
fix typo (#2721)
2 years ago
ver217 9c0943ecdb
[chatgpt] optimize generation kwargs (#2717)
2 years ago