Commit Graph

13 Commits (f5c84af0b01bcd2e993d38dc628793f7f0a8ba64)

Author SHA1 Message Date
Tong Li 19d1510ea2
[feat] Dist Loader for Eval (#5950)
4 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572)
5 months ago
digger yu a799ca343b
[fix] fix typo s/muiti-node /multi-node etc. (#5448)
8 months ago
Dongruixuan Li a7ae2b5b4c
[eval-hotfix] set few_shot_data to None when few shot is disabled (#5422)
9 months ago
Camille Zhong a5756a8720
[eval] update llama npu eval (#5366)
10 months ago
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
12 months ago
Yuanchen 3ff60d13b0
Fix ColossalEval (#5186)
12 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169)
12 months ago
github-actions[bot] f6731db67c
[format] applied code formatting on changed files in pull request 5115 (#5118)
1 year ago
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
1 year ago
Yuanchen 239cd92eff
Support mtbench (#5025)
1 year ago
Yuanchen abe071b663
fix ColossalEval (#4992)
1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786)
1 year ago