ColossalAI/applications/ColossalEval/colossal_eval/dataset
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
* add safetybench and cvalues(responsibility) eval dataset

* Modify code according to review suggestions

---------

Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>
2023-11-28 11:15:04 +08:00
..
__init__.py [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) 2023-11-28 11:15:04 +08:00
agieval.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
base.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
ceval.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
cmmlu.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
colossalai.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
cvalues.py [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) 2023-11-28 11:15:04 +08:00
gaokaobench.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
longbench.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
mmlu.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
mtbench.py Support mtbench (#5025) 2023-11-09 13:41:50 +08:00
safetybench_en.py [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) 2023-11-28 11:15:04 +08:00
safetybench_zh.py [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) 2023-11-28 11:15:04 +08:00