Commit Graph

13 Commits (8a3ff4f3153e4887587c9128c48a3f79c8727394)

Author SHA1 Message Date
YeAnbang 8a3ff4f315 fix style
4 months ago
Tong Li d08c99be0d
Merge branch 'main' into kto
4 months ago
Tong Li f585d4e38e
[ColossalChat] Hotfix for ColossalChat (#5910)
4 months ago
YeAnbang 544b7a38a1 fix style, add kto data sample
4 months ago
YeAnbang 09d5ffca1a add kto
4 months ago
YeAnbang b3594d4d68 fix orpo cross entropy loss
5 months ago
YeAnbang e7a8634636 fix eval
5 months ago
YeAnbang d888c3787c add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint
5 months ago
YeAnbang 16f3451fe2 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO
5 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572)
5 months ago
YeAnbang c8d1b4a968 add orpo
5 months ago
YeAnbang 82aecd6374 add SimPO
5 months ago
YeAnbang df5e9c53cf
[ColossalChat] Update RLHF V2 (#5286)
8 months ago