ColossalAI/applications/ColossalChat/coati
YeAnbang d888c3787c add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint 2024-07-10 10:17:08 +00:00
..
dataset Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO 2024-07-10 02:32:07 +00:00
experience_buffer [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
experience_maker [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
models Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO 2024-07-10 02:32:07 +00:00
quant [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
ray [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
trainer add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint 2024-07-10 10:17:08 +00:00
utils [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00