Yuanchen
|
34966378e8
|
[evaluation] add automatic evaluation pipeline (#3821)
* add functions for gpt evaluation
* add automatic eval
Update eval.py
* using jload and modify the type of answers1 and answers2
* Update eval.py
Update eval.py
* Update evaluator.py
* support gpt evaluation
* update readme.md
update README.md
update READNE.md
modify readme.md
* add Chinese example for config, battle prompt and evaluation prompt file
* remove GPT-4 config
* remove sample folder
---------
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>
|
2023-05-24 11:18:23 +08:00 |
Tong Li
|
c1a355940e
|
update readme
|
2023-04-28 11:56:35 +08:00 |
Tong Li
|
ed3eaa6922
|
update documentation
|
2023-04-28 11:49:21 +08:00 |
Tong Li
|
c419117329
|
update questions and readme
|
2023-04-27 19:04:26 +08:00 |
Tong Li
|
aa77ddae33
|
remove unnecessary step and update readme
|
2023-04-27 18:51:58 +08:00 |
Yuanchen
|
c4709d34cf
|
Chat evaluate (#3608)
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
|
2023-04-20 11:12:24 +08:00 |