ColossalAI/applications/Chat/examples/train_rm.sh

9 lines
357 B
Bash
Raw Normal View History

2023-03-28 12:25:36 +00:00
set_n_least_used_CUDA_VISIBLE_DEVICES 1
python train_reward_model.py --pretrain 'microsoft/deberta-v3-large' \
--model 'deberta' \
--strategy naive \
--loss_fn 'log_exp'\
--save_path 'rmstatic.pt' \
--test True