Commit Graph

91 Commits (b03d64d010cb6803b66230a0386bc62d989e6ef6)

Author SHA1 Message Date
tingfeng cao 7788e0b0a5
fix: fix sft (#3568)
2 years ago
Fazzie-Maqianli 6b1a39b17b
[coati] add costom model suppor tguide (#3579)
2 years ago
binmakeswell cc1eec2f53
[chat] update reward model sh (#3578)
2 years ago
csric e355144375
[chatgpt] Detached PPO Training (#3195)
2 years ago
MisterLin1995 1a809eddaa
[chat] ChatGPT train prompts on ray example (#3309)
2 years ago
binmakeswell 535b896435
[chat] polish tutorial doc (#3551)
2 years ago
Yuanchen 7182ac2a04
[chat]add examples of training with limited resources in chat readme (#3536)
2 years ago
zhang-yi-chi e6a132a449
[chat]: add vf_coef argument for PPOTrainer (#3318)
2 years ago
ver217 89fd10a1c9
[chat] add zero2 cpu strategy for sft training (#3520)
2 years ago
NatalieC323 635d0a1baf
[Chat Community] Update README.md (fixed#3487) (#3506)
2 years ago
gongenlei a7ca297281
[coati] Fix LlamaCritic (#3475)
2 years ago
binmakeswell 891b8e7fac
[chat] fix stage3 PPO sample sh command (#3477)
2 years ago
Fazzie-Maqianli 6afeb1202a
add community example dictionary (#3465)
2 years ago
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452)
2 years ago
YY Lin 62f4e2eb07
[Chat]Add Peft support & fix the ptx bug (#3433)
2 years ago
Dr-Corgi 73afb63594
[chat]fix save_model(#3377)
2 years ago
kingkingofall 57a3c4db6d
[chat]fix readme (#3429)
2 years ago
Camille Zhong 72cb4dd433
[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453)
2 years ago
Yuanchen b92313903f
fix save_model indent error in ppo trainer (#3450)
2 years ago
Yuanchen 773955abfa
fix save_model inin naive and ddp strategy (#3436)
2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424)
2 years ago
Yuanchen b09adff724
[chat]fix sft training for bloom, gpt and opt (#3418)
2 years ago
Camille Zhong 30412866e0
[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223)
2 years ago
Andrew 82132f4e3d
[chat] correcting a few obvious typos and grammars errors (#3338)
2 years ago
Fazzie-Maqianli 0fbadce79c
[doc] added authors to the chat application (#3307)
2 years ago
BlueRum b512893637
Polish readme link (#3306)
2 years ago
github-actions[bot] cb413ccf28
[format] applied code formatting on changed files in pull request 3300 (#3302)
2 years ago
binmakeswell 31c78f2be3
[doc] add ColossalChat news (#3304)
2 years ago
Frank Lee e235a24673
[application] updated the README (#3301)
2 years ago
BlueRum 8257e1055d
[chat]polish prompts training (#3300)
2 years ago
ver217 62f7156131
[coati] fix inference profanity check (#3299)
2 years ago
github-actions[bot] 5134ad5d1a
[format] applied code formatting on changed files in pull request 3296 (#3298)
2 years ago
BlueRum c8b723d6c2
[chat]Update Readme (#3296)
2 years ago
ver217 73b542a124
[coati] inference supports profanity check (#3295)
2 years ago
ver217 ce2cafae76
[coati] add repetition_penalty for inference (#3294)
2 years ago
Fazzie-Maqianli a88ed0f83a
add limit (#3293)
2 years ago
Fazzie-Maqianli c5484281aa
[ColossalChat]add cite for datasets (#3292)
2 years ago
Fazzie-Maqianli ec7af22a43
fix image (#3288)
2 years ago
Fazzie-Maqianli 1f7d9afbf8
add example (#3286)
2 years ago
ver217 4905b21b94
[coati] fix inference output (#3285)
2 years ago
Fazzie-Maqianli b0ce5a1032
[Coati] first commit (#3283)
2 years ago