You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/applications/ColossalEval/examples/gpt_evaluation
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
11 months ago
..
config [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
eval.py [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
eval.sh [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago
inference.py Improve logic for selecting metrics (#5196) 11 months ago
inference.sh [ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) 12 months ago