Commit Graph

4 Commits (ea088b5f75e9c9a79d67b370286da2a1508688c8)

Author SHA1 Message Date
Yuanchen eae01b6740
Improve logic for selecting metrics (#5196)
11 months ago
Yuanchen cefdc32615
[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169)
12 months ago
Yuanchen 239cd92eff
Support mtbench (#5025)
1 year ago
Yuanchen ce777853ae
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786)
1 year ago