Commit Graph

55 Commits (1d96a562bb73d33424a8f91ac7463fa4e3b7dada)

Author SHA1 Message Date
Orion-Zheng 43ad0d9ef0 fix wrong EOS token in ColossalChat
1 year ago
flybird11111 be400a0936
[chat] fix gemini strategy (#4698)
1 year ago
Wenhao Chen 901ab1eedd
[chat]: add lora merge weights config (#4766)
1 year ago
Wenhao Chen 7b9b86441f
[chat]: update rm, add wandb and fix bugs (#4471)
1 year ago
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752)
1 year ago
digger yu e4fc57c3de
Optimized some syntax errors in the documentation and code under applications/ (#4127)
1 year ago
Ying Liu c648dc093f fix colossalai version in coati examples
1 year ago
yingliu-hpc 1467e3b41b
[coati] add chatglm model (#4539)
1 year ago
Wenhao Chen 6d41c3f2aa
[doc] update Coati README (#4405)
1 year ago
Wenhao Chen da4f7b855f
[chat] fix bugs and add unit tests (#4213)
1 year ago
Yuanchen 5187c96b7c
support session-based training (#4313)
1 year ago
Xu Kai 1ce997daaf [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271)
1 year ago
CZYCW dee1c96344 [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250)
1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117)
1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094)
1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987)
1 year ago
Wenhao Chen 9d02590c9a
[chat] refactor actor class (#3968)
1 year ago
Hongxin Liu b5f0566363
[chat] add distributed PPO trainer (#3740)
1 year ago
binmakeswell ad2cf58f50
[chat] add performance and tutorial (#3786)
2 years ago
Yuanchen 05759839bd
[chat] fix bugs in stage 3 training (#3759)
2 years ago
digger-yu ad6460cf2c
[NFC] fix typo applications/ and colossalai/ (#3735)
2 years ago
MisterLin1995 f7361ee1bd
[chat] fix community example ray (#3719)
2 years ago
zhang-yi-chi 2da5d81dec
[chat] fix train_prompts.py gemini strategy bug (#3666)
2 years ago
digger-yu 65bdc3159f
fix some spelling error with applications/Chat/examples/ (#3692)
2 years ago
Camille Zhong 0f785cb1f3
[chat] PPO stage3 doc enhancement (#3679)
2 years ago
digger-yu 6650daeb0a
[doc] fix chat spelling error (#3671)
2 years ago
tanitna 1a60dc07a8
[chat] typo accimulation_steps -> accumulation_steps (#3662)
2 years ago
binmakeswell 268b3cd80d
[chat] set default zero2 strategy (#3667)
2 years ago
Hongxin Liu 842768a174
[chat] refactor model save/load logic (#3654)
2 years ago
Hongxin Liu 6ef7011462
[chat] remove lm model class (#3653)
2 years ago
Camille Zhong 8bccb72c8d
[Doc] enhancement on README.md for chat examples (#3646)
2 years ago
Hongxin Liu 2a951955ad
[chat] refactor trainer (#3648)
2 years ago
zhang-yi-chi 739cfe3360 [chat] fix enable single gpu training bug
2 years ago
digger-yu d7bf284706
[chat] polish code note typo (#3612)
2 years ago
Yuanchen 1ec0d386a9
reconstruct chat trainer and fix training script (#3588)
2 years ago
Camille Zhong 36a519b49f Update test_ci.sh
2 years ago
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
Fazzie-Maqianli 6b1a39b17b
[coati] add costom model suppor tguide (#3579)
2 years ago
binmakeswell cc1eec2f53
[chat] update reward model sh (#3578)
2 years ago
csric e355144375
[chatgpt] Detached PPO Training (#3195)
2 years ago
MisterLin1995 1a809eddaa
[chat] ChatGPT train prompts on ray example (#3309)
2 years ago
ver217 89fd10a1c9
[chat] add zero2 cpu strategy for sft training (#3520)
2 years ago
NatalieC323 635d0a1baf
[Chat Community] Update README.md (fixed#3487) (#3506)
2 years ago
binmakeswell 891b8e7fac
[chat] fix stage3 PPO sample sh command (#3477)
2 years ago
Fazzie-Maqianli 6afeb1202a
add community example dictionary (#3465)
2 years ago
YY Lin 62f4e2eb07
[Chat]Add Peft support & fix the ptx bug (#3433)
2 years ago
kingkingofall 57a3c4db6d
[chat]fix readme (#3429)
2 years ago
Camille Zhong 72cb4dd433
[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453)
2 years ago
Camille Zhong 30412866e0
[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223)
2 years ago