Michelle
|
e89b127d8e
|
[chat]: fix chat evaluation possible bug (#4064)
* fix chat eval
* fix utils
* fix utils
* add comment
---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>
|
2023-06-26 15:26:07 +08:00 |
Yuanchen
|
21c4c0b1a0
|
support UniEval and add CHRF metric (#3924)
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
|
2023-06-08 17:38:47 +08:00 |
Yuanchen
|
57a6d7685c
|
support evaluation for english (#3880)
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
|
2023-06-05 21:24:21 +08:00 |
digger yu
|
e2d81eba0d
|
[nfc] fix typo colossalai/ applications/ (#3831)
* fix typo colossalai/autochunk auto_parallel amp
* fix typo colossalai/auto_parallel nn utils etc.
* fix typo colossalai/auto_parallel autochunk fx/passes etc.
* fix typo docs/
* change placememt_policy to placement_policy in docs/ and examples/
* fix typo colossalai/ applications/
|
2023-05-25 16:19:41 +08:00 |
Yuanchen
|
34966378e8
|
[evaluation] add automatic evaluation pipeline (#3821)
* add functions for gpt evaluation
* add automatic eval
Update eval.py
* using jload and modify the type of answers1 and answers2
* Update eval.py
Update eval.py
* Update evaluator.py
* support gpt evaluation
* update readme.md
update README.md
update READNE.md
modify readme.md
* add Chinese example for config, battle prompt and evaluation prompt file
* remove GPT-4 config
* remove sample folder
---------
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>
|
2023-05-24 11:18:23 +08:00 |