16 Commits (641b1ee71a19e2337f3363620b228dd355835b04)

Author SHA1 Message Date
Hongxin Liu 641b1ee71a
[devops] remove post commit ci (#5566) 8 months ago
digger yu a799ca343b
[fix] fix typo s/muiti-node /multi-node etc. (#5448) 8 months ago
Dongruixuan Li a7ae2b5b4c
[eval-hotfix] set few_shot_data to None when few shot is disabled (#5422) 9 months ago
Camille Zhong a5756a8720
[eval] update llama npu eval (#5366) 10 months ago
digger yu 756c400ad2
fix typo in applications/ColossalEval/README.md (#5250) 11 months ago
Tong Li d992b55968
[Colossal-LLaMA-2] Release Colossal-LLaMA-2-13b-base model (#5224) 11 months ago
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196) 11 months ago
Yuanchen 3ff60d13b0
Fix ColossalEval (#5186) 11 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) 12 months ago
github-actions[bot] f6731db67c
[format] applied code formatting on changed files in pull request 5115 (#5118) 12 months ago
digger yu 9110406a47
fix typo change JOSNL TO JSONL etc. (#5116) 12 months ago
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) 1 year ago
Yuanchen 239cd92eff
Support mtbench (#5025) 1 year ago
Yuanchen abe071b663
fix ColossalEval (#4992) 1 year ago
Yuanchen 1fa8c5e09f
Update Qwen-7B results (#4821) 1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) 1 year ago