ColossalAI

Commit Graph

Author	SHA1	Message	Date
Yuanchen	2925f47399	[evaluate] support gpt evaluation with reference (#3972 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-06-13 15:12:29 +08:00
Yuanchen	21c4c0b1a0	support UniEval and add CHRF metric (#3924 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-06-08 17:38:47 +08:00
Yuanchen	57a6d7685c	support evaluation for english (#3880 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-06-05 21:24:21 +08:00
Yuanchen	2506e275b8	[evaluation] improvement on evaluation (#3862 ) * fix a bug when the config file contains one category but the answer file doesn't contains that category * fix Chinese prompt file * support gpt-3.5-turbo and gpt-4 evaluation * polish and update README * resolve pr comments --------- Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-05-30 11:48:41 +08:00
Yuanchen	34966378e8	[evaluation] add automatic evaluation pipeline (#3821 ) * add functions for gpt evaluation * add automatic eval Update eval.py * using jload and modify the type of answers1 and answers2 * Update eval.py Update eval.py * Update evaluator.py * support gpt evaluation * update readme.md update README.md update READNE.md modify readme.md * add Chinese example for config, battle prompt and evaluation prompt file * remove GPT-4 config * remove sample folder --------- Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com> Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>	2023-05-24 11:18:23 +08:00
Tong Li	c1a355940e	update readme	2023-04-28 11:56:35 +08:00
Tong Li	ed3eaa6922	update documentation	2023-04-28 11:49:21 +08:00
Tong Li	c419117329	update questions and readme	2023-04-27 19:04:26 +08:00
Tong Li	aa77ddae33	remove unnecessary step and update readme	2023-04-27 18:51:58 +08:00
Yuanchen	c4709d34cf	Chat evaluate (#3608 ) Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>	2023-04-20 11:12:24 +08:00

10 Commits (abe4f971e0e316e8558569bf30faca77772367b6)