Commit Graph

71 Commits (401bf3a8a7dd8a26fba189551dccfc61a7079b4e)

Author SHA1 Message Date
Guoqiang QI 12e3670b48
Fix typo: removing duplicate imports of AutoTokenizer in Trarnsformer. 2023-05-17 15:09:31 +08:00
邓钢清 486edfed81
Update main.py
fix:修复预测时输入输出长度和超过512时,导致所有预测结果为空的bug
2023-05-08 10:58:24 +08:00
duzx16 c95b6b9d0d Update english README 2023-04-29 18:21:48 +08:00
duzx16 2a9119bc96 Update train_chat.sh 2023-04-27 18:32:33 +08:00
duzx16 3956b8b326 Update conversation data instruction 2023-04-27 15:35:54 +08:00
duzx16 8e83083d7f Update README.md 2023-04-19 23:00:29 +08:00
is 8e85119d7c ptuning适配的web_demo. 2023-04-19 21:34:56 +08:00
duzx16 e182e13cf5 Update instructions 2023-04-15 11:17:41 +08:00
duzx16 c34ad40092 Add instructions 2023-04-15 11:05:12 +08:00
duzx16 8633db1503 Merge branch 'main' of github.com:THUDM/ChatGLM-6B 2023-04-13 16:10:25 +08:00
duzx16 5ade1e4055 Update loading instructions 2023-04-13 16:10:10 +08:00
rainatam fbd9c75d46 Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main 2023-04-13 16:01:54 +08:00
rainatam 9ac7187cc1 Add finetune evaluation script 2023-04-13 15:58:19 +08:00
duzx16 63b1e4e804 Add line instruction 2023-04-13 15:47:56 +08:00
duzx16 8123b59215 Add length instruction 2023-04-13 15:24:59 +08:00
duzx16 c6294ab3fd Merge branch 'main' of github.com:THUDM/ChatGLM-6B 2023-04-13 15:09:40 +08:00
duzx16 60275ccec5 Update README 2023-04-13 15:07:58 +08:00
rainatam 6fb0380847 Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main 2023-04-13 13:53:01 +08:00
rainatam 5fb705cd5b Add option for saving checkpoint 2023-04-13 13:52:35 +08:00
duzx16 7607cfe585 Fix turn_idx in eval 2023-04-12 23:41:37 +08:00
duzx16 f06df225dd Fix turn_idx 2023-04-12 23:34:44 +08:00
duzx16 da626f8b23 Add instruction for pre_seq_len 2023-04-12 22:42:36 +08:00
rainatam a1d9dcc517 Update README 2023-04-12 21:11:29 +08:00
rainatam 1a368afd26 Update README 2023-04-12 16:43:34 +08:00
rainatam 3736c1ae98 Update README 2023-04-12 12:28:25 +08:00
duzx16 57e9da3822 Merge branch 'main' of github.com:THUDM/ChatGLM-6B 2023-04-12 09:49:20 +08:00
duzx16 79e4d8ba8a Remove todo 2023-04-12 09:49:14 +08:00
rainatam 0c2806fea8 Fix typo 2023-04-11 13:48:54 +08:00
rainatam 75aa887c20 Update LoRA evaluation results 2023-04-11 01:06:11 +08:00
rainatam 166a6e70f1 Update LoRA evaluation results 2023-04-11 01:04:03 +08:00
rainatam ec5f258de9 Update evaluation results 2023-04-10 22:31:08 +08:00
rainatam 173ccd8d27 Update trainer 2023-04-10 19:40:21 +08:00
rainatam 2073ac75d4 Update README 2023-04-10 18:43:17 +08:00
rainatam 2a5250ffcb Update trainer 2023-04-10 18:32:40 +08:00
rainatam 47a5ec121e Add deepspeed finetuning scripts 2023-04-10 17:28:27 +08:00
rainatam cbb9f44e30 Save PrefixEncoder params only 2023-04-10 17:26:17 +08:00
Vinlic科技 d694a0087e
错别字修正 2023-04-07 14:27:02 +08:00
duzx16 fd3d40fa9f Merge branch 'main' of github.com:THUDM/ChatGLM-6B 2023-04-06 22:43:10 +08:00
duzx16 ea682a6f51 Update default hyperparameters
Remove hardcode token id
2023-04-06 22:42:41 +08:00
rainatam 0cf3d08841 Update README 2023-04-06 22:30:30 +08:00
rainatam a1ecafd91f Update README 2023-04-06 20:47:32 +08:00
rainatam ed79244725 Update README 2023-04-06 20:46:26 +08:00
rainatam 83ef59b146 Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main 2023-04-06 20:23:46 +08:00
rainatam 1cbe2d1981 Remove logging 2023-04-06 20:22:56 +08:00
rainatam 5865924cc6 Add training for chat data 2023-04-06 20:21:29 +08:00
duzx16 6792ca6805 Add English readme 2023-04-06 17:55:31 +08:00
duzx16 7131d29f2d Add English readme 2023-04-06 17:51:20 +08:00
rainatam a9fc018444 Update evaluation results and bleu score function 2023-04-06 15:16:36 +08:00
Qingsong Lv 5de0055408 fix finetune pad bug and add sat readme 2023-04-03 11:27:28 +00:00
duzx16 ed9631a96b Add deploement for ptuning model 2023-04-03 15:23:32 +08:00