Commit Graph

10 Commits (5caad13055e802e2665f1d70593116103a72395a)

Author SHA1 Message Date
Tong Li 7a60161035
update readme (#6116)
3 weeks ago
Tong Li 89a9a600bc
[MCTS] Add self-refined MCTS (#6098)
1 month ago
Tong Li c650a906db
[Hotfix] Remove deprecated install (#6042)
3 months ago
Tong Li ad3fa4f49c
[Hotfix] README link (#5966)
4 months ago
Tong Li 1aeb5e8847
[hotfix] Remove unused plan section (#5957)
4 months ago
YeAnbang 09d5ffca1a add kto
4 months ago
YeAnbang 115c4cc5a4 hotfix citation
5 months ago
YeAnbang c8d1b4a968 add orpo
5 months ago
YeAnbang 82aecd6374 add SimPO
5 months ago
YeAnbang df5e9c53cf
[ColossalChat] Update RLHF V2 (#5286)
8 months ago