Yuanchen
b92313903f
fix save_model indent error in ppo trainer ( #3450 )
...
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
2 years ago
Yuanchen
773955abfa
fix save_model inin naive and ddp strategy ( #3436 )
...
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
2 years ago
ver217
26b7aac0be
[zero] reorganize zero/gemini folder structure ( #3424 )
...
* [zero] refactor low-level zero folder structure
* [zero] fix legacy zero import path
* [zero] fix legacy zero import path
* [zero] remove useless import
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] fix test import path
* [zero] fix test
* [zero] fix circular import
* [zero] update import
2 years ago
Yuanchen
b09adff724
[chat]fix sft training for bloom, gpt and opt ( #3418 )
...
fix sft training for bloom, gpt and opt
2 years ago
Camille Zhong
30412866e0
[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 ( #3223 )
...
* Add RoBERTa for RLHF Stage 2 & 3 (test)
RoBERTa for RLHF Stage 2 & 3 (still in testing)
* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
This reverts commit 06741d894d
.
* Add RoBERTa for RLHF stage 2 & 3
1. add roberta folder under model folder
2. add roberta option in train_reward_model.py
3. add some test in testci
* add test for reward model training
* Update test_ci.sh
* Revert "Update test_ci.sh"
This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
* Add RoBERTa for RLHF Stage 2 & 3 (test)
RoBERTa for RLHF Stage 2 & 3 (still in testing)
* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
This reverts commit 06741d894d
.
* Add RoBERTa for RLHF stage 2 & 3
1. add roberta folder under model folder
2. add roberta option in train_reward_model.py
3. add some test in testci
* Update test_ci.sh
* Revert "Update test_ci.sh"
This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
* update roberta with coati
2 years ago
Andrew
82132f4e3d
[chat] correcting a few obvious typos and grammars errors ( #3338 )
2 years ago
Fazzie-Maqianli
0fbadce79c
[doc] added authors to the chat application ( #3307 )
2 years ago
BlueRum
b512893637
Polish readme link ( #3306 )
2 years ago
github-actions[bot]
cb413ccf28
[format] applied code formatting on changed files in pull request 3300 ( #3302 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
binmakeswell
31c78f2be3
[doc] add ColossalChat news ( #3304 )
...
* [doc] add ColossalChat news
* [doc] add ColossalChat news
2 years ago
Frank Lee
e235a24673
[application] updated the README ( #3301 )
...
* [application] updated the README
* polish code
2 years ago
BlueRum
8257e1055d
[chat]polish prompts training ( #3300 )
...
* polish train_prompts
* polish readme
2 years ago
ver217
62f7156131
[coati] fix inference profanity check ( #3299 )
2 years ago
github-actions[bot]
5134ad5d1a
[format] applied code formatting on changed files in pull request 3296 ( #3298 )
...
Co-authored-by: github-actions <github-actions@github.com>
2 years ago
BlueRum
c8b723d6c2
[chat]Update Readme ( #3296 )
...
* Update README.md
* Update README.md
* Update README.md
* update example readme
2 years ago
ver217
73b542a124
[coati] inference supports profanity check ( #3295 )
2 years ago
ver217
ce2cafae76
[coati] add repetition_penalty for inference ( #3294 )
2 years ago
Fazzie-Maqianli
a88ed0f83a
add limit ( #3293 )
2 years ago
Fazzie-Maqianli
c5484281aa
[ColossalChat]add cite for datasets ( #3292 )
2 years ago
Fazzie-Maqianli
ec7af22a43
fix image ( #3288 )
2 years ago
Fazzie-Maqianli
1f7d9afbf8
add example ( #3286 )
2 years ago
ver217
4905b21b94
[coati] fix inference output ( #3285 )
...
* [coati] fix inference requirements
* [coati] add output postprocess
* [coati] update inference readme
* [coati] fix inference requirements
2 years ago
Fazzie-Maqianli
b0ce5a1032
[Coati] first commit ( #3283 )
2 years ago