Commit Graph

11 Commits (62cdac6b7b655e11626382d64e56503146a516ee)

Author SHA1 Message Date
YeAnbang 30f4e31a33
[Chat] Fix lora (#5946)
4 months ago
YeAnbang 544b7a38a1 fix style, add kto data sample
4 months ago
YeAnbang 09d5ffca1a add kto
4 months ago
YeAnbang 16f3451fe2 Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into rlhf_SimPO
5 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572)
5 months ago
YeAnbang c8d1b4a968 add orpo
5 months ago
YeAnbang 0b2d6275c4 fix dataloader
5 months ago
YeAnbang 82aecd6374 add SimPO
5 months ago
pre-commit-ci[bot] 1b880ce095 [pre-commit.ci] auto fixes from pre-commit.com hooks
6 months ago
YeAnbang 929e1e3da4 upgrade ppo dpo rm script
6 months ago
YeAnbang df5e9c53cf
[ColossalChat] Update RLHF V2 (#5286)
8 months ago