dataset
|
support session-based training (#4313)
|
2023-07-28 11:29:55 +08:00 |
experience_maker
|
[chat] refactor actor class (#3968)
|
2023-06-13 13:31:56 +08:00 |
kernels
|
[CI] fix some spelling errors (#3707)
|
2023-05-10 17:12:03 +08:00 |
models
|
[chat] fix compute_approx_kl (#4338)
|
2023-08-01 10:21:45 +08:00 |
quant
|
[chat] add distributed PPO trainer (#3740)
|
2023-06-07 10:41:16 +08:00 |
replay_buffer
|
[chat] polish code note typo (#3612)
|
2023-04-20 17:22:15 +08:00 |
__init__.py
|
[Coati] first commit (#3283)
|
2023-03-28 20:25:36 +08:00 |