Commit Graph

13 Commits (597b2060013045cf0d0f0f8fddfc1b77ef716818)

Author SHA1 Message Date
flybird11111 0c10afd372
[FP8] rebase main (#5963)
4 months ago
pre-commit-ci[bot] 7c2f79fa98
[pre-commit.ci] pre-commit autoupdate (#5572)
5 months ago
digger yu a799ca343b
[fix] fix typo s/muiti-node /multi-node etc. (#5448)
8 months ago
Dongruixuan Li a7ae2b5b4c
[eval-hotfix] set few_shot_data to None when few shot is disabled (#5422)
9 months ago
Camille Zhong a5756a8720
[eval] update llama npu eval (#5366)
10 months ago
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
11 months ago
Yuanchen 3ff60d13b0
Fix ColossalEval (#5186)
12 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169)
12 months ago
github-actions[bot] f6731db67c
[format] applied code formatting on changed files in pull request 5115 (#5118)
1 year ago
Zian(Andy) Zheng 7b789f4dd2 [FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
1 year ago
Yuanchen 239cd92eff
Support mtbench (#5025)
1 year ago
Yuanchen abe071b663
fix ColossalEval (#4992)
1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786)
1 year ago