Commit Graph

14 Commits (da885ed5405c4472f18825f80c98bb505bfad23b)

Author SHA1 Message Date
Dongruixuan Li a7ae2b5b4c
[eval-hotfix] set few_shot_data to None when few shot is disabled (#5422)
9 months ago
Camille Zhong a5756a8720
[eval] update llama npu eval (#5366)
10 months ago
digger yu 756c400ad2
fix typo in applications/ColossalEval/README.md (#5250)
11 months ago
Tong Li d992b55968
[Colossal-LLaMA-2] Release Colossal-LLaMA-2-13b-base model (#5224)
11 months ago
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
11 months ago
Yuanchen 3ff60d13b0
Fix ColossalEval (#5186)
12 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169)
12 months ago
github-actions[bot] f6731db67c
[format] applied code formatting on changed files in pull request 5115 (#5118)
1 year ago
digger yu 9110406a47
fix typo change JOSNL TO JSONL etc. (#5116)
1 year ago
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
1 year ago
Yuanchen 239cd92eff
Support mtbench (#5025)
1 year ago
Yuanchen abe071b663
fix ColossalEval (#4992)
1 year ago
Yuanchen 1fa8c5e09f
Update Qwen-7B results (#4821)
1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786)
1 year ago