Commit Graph

6 Commits (09c5f72595228ad5f8e82005b8e442292bc063d1)

Author SHA1 Message Date
YeAnbang 12fe8b5858 refactor evaluation 2024-07-22 05:57:39 +00:00
YeAnbang b3594d4d68 fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
YeAnbang e7a8634636 fix eval 2024-07-11 03:35:03 +00:00
pre-commit-ci[bot] 8a9721bafe [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2024-07-10 10:44:32 +00:00
YeAnbang d888c3787c add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint 2024-07-10 10:17:08 +00:00
YeAnbang c8d1b4a968 add orpo 2024-06-27 07:20:28 +00:00