callbacks
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
__init__.py
|
add kto
|
2024-07-18 07:54:11 +00:00 |
base.py
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
dpo.py
|
fix eval
|
2024-07-11 03:35:03 +00:00 |
kto.py
|
fix style, add kto data sample
|
2024-07-18 08:38:56 +00:00 |
orpo.py
|
fix orpo cross entropy loss
|
2024-07-15 02:12:05 +00:00 |
ppo.py
|
[ColossalChat] Update RLHF V2 (#5286)
|
2024-03-29 14:12:29 +08:00 |
rm.py
|
fix eval
|
2024-07-11 03:35:03 +00:00 |
sft.py
|
fix eval
|
2024-07-11 03:35:03 +00:00 |
utils.py
|
[pre-commit.ci] pre-commit autoupdate (#5572)
|
2024-07-01 17:16:41 +08:00 |