Commit Graph

5 Commits (489f215ad9ed8c5800f6076f7029e9404f8d8c06)

Author SHA1 Message Date
YeAnbang 0472f44163 fix logprob, add filtering, temperature annealing, lr descent 2025-03-21 10:24:24 +08:00
YeAnbang 7ee4452f8c fix vllm 2025-03-19 17:11:10 +08:00
Tong Li 0f566cc2d4 add algo selection 2025-03-06 14:29:22 +08:00
Tong Li ffd3878a1e add simple grpo 2025-02-23 22:54:26 +08:00
Hongxin Liu 43c9b5fb44
[chat] add distributed impl (#6210) 2025-02-21 15:24:23 +08:00