ColossalAI/applications/Chat/coati
Camille Zhong 30412866e0
[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223)
* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894d.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* add test for reward model training

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894d.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* update roberta with coati
2023-04-03 10:11:03 +08:00
..
dataset [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
experience_maker [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
models [chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223) 2023-04-03 10:11:03 +08:00
replay_buffer [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
trainer [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
utils [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00
__init__.py [Coati] first commit (#3283) 2023-03-28 20:25:36 +08:00