dataset
|
refactor tokenization
|
2024-07-19 10:10:48 +00:00 |
experience_buffer
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
experience_maker
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
models
|
fix style, add kto data sample
|
2024-07-18 08:38:56 +00:00 |
quant
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
ray
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
trainer
|
Merge branch 'main' into kto
|
2024-07-19 15:23:31 +08:00 |
utils
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
__init__.py
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |