ColossalAI/applications/ColossalChat/coati/models
YeAnbang 929e1e3da4 upgrade ppo dpo rm script 2024-06-07 07:01:30 +00:00
..
__init__.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
base.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
critic.py upgrade ppo dpo rm script 2024-06-07 07:01:30 +00:00
generation.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
lora.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
loss.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
reward_model.py upgrade ppo dpo rm script 2024-06-07 07:01:30 +00:00
utils.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00