digger yu
|
9110406a47
|
fix typo change JOSNL TO JSONL etc. (#5116)
|
12 months ago |
Zian(Andy) Zheng
|
7b789f4dd2
|
[FEATURE] Add Safety Eval Datasets to ColossalEval (#5095)
* add safetybench and cvalues(responsibility) eval dataset
* Modify code according to review suggestions
---------
Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>
|
1 year ago |
Yuanchen
|
239cd92eff
|
Support mtbench (#5025)
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
|
1 year ago |
Yuanchen
|
abe071b663
|
fix ColossalEval (#4992)
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
|
1 year ago |
Yuanchen
|
1fa8c5e09f
|
Update Qwen-7B results (#4821)
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
|
1 year ago |
Yuanchen
|
ce777853ae
|
[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786)
* Add ColossalEval
* Delete evaluate in Chat
---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Tong Li <tong.li352711588@gmail.com>
|
1 year ago |