diff --git a/applications/ChatGPT/examples/README.md b/applications/ChatGPT/examples/README.md index 39a769110..3876d20f0 100644 --- a/applications/ChatGPT/examples/README.md +++ b/applications/ChatGPT/examples/README.md @@ -15,9 +15,9 @@ Use these code to train your reward model. ```shell # Naive reward model training -python train_reward_model.py --pretrain --model --strategy naive +python train_reward_model.py --pretrain --model --strategy naive # use colossalai_zero2 -torchrun --standalone --nproc_per_node=2 train_reward_model.py --pretrain --model --strategy colossalai_zero2 +torchrun --standalone --nproc_per_node=2 train_reward_model.py --pretrain --model --strategy colossalai_zero2 ``` ## Train with dummy prompt data (Stage 3)