ColossalAI

History

Camille Zhong 30412866e0 [chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 ) * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * add test for reward model training * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit `06741d894d`. * Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci * Update test_ci.sh * Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. * update roberta with coati		2023-04-03 10:11:03 +08:00
..
__init__.py	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 )	2023-04-03 10:11:03 +08:00
roberta_actor.py	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 )	2023-04-03 10:11:03 +08:00
roberta_critic.py	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 )	2023-04-03 10:11:03 +08:00
roberta_rm.py	[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223 )	2023-04-03 10:11:03 +08:00