ColossalAI/applications/ColossalEval/examples/dataset_evaluation
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
Co-authored-by: Xu <yuanchen.xu00@gmail.com>
2023-12-22 14:52:50 +08:00
..
config [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
eval_dataset.py Support mtbench (#5025) 2023-11-09 13:41:50 +08:00
eval_dataset.sh [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 2023-09-24 23:14:11 +08:00
inference.py Improve logic for selecting metrics (#5196) 2023-12-22 14:52:50 +08:00
inference.sh [ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) 2023-12-12 14:47:35 +08:00