ColossalAI/applications/ColossalChat/examples/training_scripts
YeAnbang 7ae87b3159 fix training script 2024-06-07 07:01:31 +00:00
..
hostfile upgrade colossal-chat support tp_group>1, add sp for sft 2024-06-07 07:01:30 +00:00
train_dpo.py moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
train_dpo.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_ppo.py moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
train_ppo.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_rm.py moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
train_rm.sh [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
train_sft.py moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
train_sft.sh fix training script 2024-06-07 07:01:31 +00:00