You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/applications/ColossalEval/examples/dataset_evaluation
Camille Zhong a5756a8720
[eval] update llama npu eval (#5366)
10 months ago
..
config [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
eval_dataset.py Support mtbench (#5025) 1 year ago
eval_dataset.sh [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
inference.py [eval] update llama npu eval (#5366) 10 months ago
inference.sh [ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) 12 months ago