Commit Graph

3 Commits (ab856fd308d1747c56aebaee125ec06f773f6b6c)

Author SHA1 Message Date
pre-commit-ci[bot] 1b880ce095 [pre-commit.ci] auto fixes from pre-commit.com hooks
6 months ago
YeAnbang 929e1e3da4 upgrade ppo dpo rm script
6 months ago
YeAnbang df5e9c53cf
[ColossalChat] Update RLHF V2 (#5286)
8 months ago