Commit Graph

52 Commits (07c2e3d09cd6bf42f280f20f0cc2ba2eb47677cc)

Author SHA1 Message Date
Hongxin Liu 079bf3cb26
[misc] update pre-commit and run all files (#4752)
1 year ago
digger yu e4fc57c3de
Optimized some syntax errors in the documentation and code under applications/ (#4127)
1 year ago
yingliu-hpc 1467e3b41b
[coati] add chatglm model (#4539)
1 year ago
Wenhao Chen 6d41c3f2aa
[doc] update Coati README (#4405)
1 year ago
Wenhao Chen da4f7b855f
[chat] fix bugs and add unit tests (#4213)
1 year ago
Wenhao Chen 75c5389037
[chat] fix compute_approx_kl (#4338)
1 year ago
Yuanchen 5187c96b7c
support session-based training (#4313)
1 year ago
yuxuan-lou 0991405361 [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277)
1 year ago
Zirui Zhu 9e512938f6 [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278)
1 year ago
RichardoLuo 709e121cd5 [NFC] polish applications/Chat/coati/models/generation.py code style (#4275)
1 year ago
アマデウス caa4433072 [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273)
1 year ago
shenggan 798cb72907 [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260)
1 year ago
Zheng Zangwei (Alex Zheng) b2debdc09b [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259)
1 year ago
Junming Wu 77c469e1ba [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248)
1 year ago
Frank Lee f447ca1811 [chat] removed cache file (#4155)
1 year ago
wukong1992 c1c672d0f0 [shardformer] shardformer support t5 model (#3994)
1 year ago
Wenhao Chen 3d8d5d0d58
[chat] use official transformers and fix some issues (#4117)
1 year ago
Wenhao Chen edd75a59ea
[chat] remove naive strategy and split colossalai strategy (#4094)
1 year ago
Wenhao Chen b03d64d010
[chat] refactor trainer class (#4080)
1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083)
1 year ago
Wenhao Chen 153b957a1b
[chat] refactor strategy class with booster api (#3987)
1 year ago
digger yu d4fb7bfda7
fix typo applications/Chat/coati/ (#3947)
1 year ago
Wenhao Chen 9d02590c9a
[chat] refactor actor class (#3968)
1 year ago
Hongxin Liu b5f0566363
[chat] add distributed PPO trainer (#3740)
2 years ago
digger yu e2d81eba0d
[nfc] fix typo colossalai/ applications/ (#3831)
2 years ago
digger yu 9265f2d4d7
[NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779)
2 years ago
Yuanchen 05759839bd
[chat] fix bugs in stage 3 training (#3759)
2 years ago
digger-yu b7141c36dd
[CI] fix some spelling errors (#3707)
2 years ago
Hongxin Liu 7bd0bee8ea
[chat] add opt attn kernel (#3655)
2 years ago
tanitna 1a60dc07a8
[chat] typo accimulation_steps -> accumulation_steps (#3662)
2 years ago
Hongxin Liu 842768a174
[chat] refactor model save/load logic (#3654)
2 years ago
Hongxin Liu 6ef7011462
[chat] remove lm model class (#3653)
2 years ago
Hongxin Liu 2a951955ad
[chat] refactor trainer (#3648)
2 years ago
Hongxin Liu f8288315d9
[chat] polish performance evaluator (#3647)
2 years ago
Hongxin Liu 50793b35f4
[gemini] accelerate inference (#3641)
2 years ago
ddobokki df309fc6ab
[Chat] Remove duplicate functions (#3625)
2 years ago
digger-yu d7bf284706
[chat] polish code note typo (#3612)
2 years ago
Yuanchen 1ec0d386a9
reconstruct chat trainer and fix training script (#3588)
2 years ago
Camille Zhong 36a519b49f Update test_ci.sh
2 years ago
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
csric e355144375
[chatgpt] Detached PPO Training (#3195)
2 years ago
zhang-yi-chi e6a132a449
[chat]: add vf_coef argument for PPOTrainer (#3318)
2 years ago
gongenlei a7ca297281
[coati] Fix LlamaCritic (#3475)
2 years ago
YY Lin 62f4e2eb07
[Chat]Add Peft support & fix the ptx bug (#3433)
2 years ago
Dr-Corgi 73afb63594
[chat]fix save_model(#3377)
2 years ago
Camille Zhong 72cb4dd433
[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453)
2 years ago
Yuanchen b92313903f
fix save_model indent error in ppo trainer (#3450)
2 years ago
Yuanchen 773955abfa
fix save_model inin naive and ddp strategy (#3436)
2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424)
2 years ago
Yuanchen b09adff724
[chat]fix sft training for bloom, gpt and opt (#3418)
2 years ago