ColossalAI/applications/ColossalChat/coati
YeAnbang 0b4a33548c moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
..
dataset moupdate ci tests, st ci test cases passed, tp failed in generation for ppo, sp is buggy 2024-06-07 07:01:31 +00:00
experience_buffer [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
experience_maker [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
models upgrade ppo dpo rm script 2024-06-07 07:01:30 +00:00
quant [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
ray [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
trainer [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
utils [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00