Commit Graph

4 Commits (544b7a38a167cb05cdc7590cfc100e23c0ed5ab7)

Author SHA1 Message Date
YeAnbang 544b7a38a1 fix style, add kto data sample 2024-07-18 08:38:56 +00:00
YeAnbang 09d5ffca1a add kto 2024-07-18 07:54:11 +00:00
YeAnbang f6ef5c3609 fix style 2024-07-10 10:37:17 +00:00
YeAnbang d888c3787c add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Support lora with gradient checkpoint 2024-07-10 10:17:08 +00:00