Guoqiang QI
|
12e3670b48
|
Fix typo: removing duplicate imports of AutoTokenizer in Trarnsformer.
|
2023-05-17 15:09:31 +08:00 |
邓钢清
|
486edfed81
|
Update main.py
fix:修复预测时输入输出长度和超过512时,导致所有预测结果为空的bug
|
2023-05-08 10:58:24 +08:00 |
duzx16
|
c95b6b9d0d
|
Update english README
|
2023-04-29 18:21:48 +08:00 |
duzx16
|
2a9119bc96
|
Update train_chat.sh
|
2023-04-27 18:32:33 +08:00 |
duzx16
|
3956b8b326
|
Update conversation data instruction
|
2023-04-27 15:35:54 +08:00 |
duzx16
|
8e83083d7f
|
Update README.md
|
2023-04-19 23:00:29 +08:00 |
is
|
8e85119d7c
|
ptuning适配的web_demo.
|
2023-04-19 21:34:56 +08:00 |
duzx16
|
e182e13cf5
|
Update instructions
|
2023-04-15 11:17:41 +08:00 |
duzx16
|
c34ad40092
|
Add instructions
|
2023-04-15 11:05:12 +08:00 |
duzx16
|
8633db1503
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-04-13 16:10:25 +08:00 |
duzx16
|
5ade1e4055
|
Update loading instructions
|
2023-04-13 16:10:10 +08:00 |
rainatam
|
fbd9c75d46
|
Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main
|
2023-04-13 16:01:54 +08:00 |
rainatam
|
9ac7187cc1
|
Add finetune evaluation script
|
2023-04-13 15:58:19 +08:00 |
duzx16
|
63b1e4e804
|
Add line instruction
|
2023-04-13 15:47:56 +08:00 |
duzx16
|
8123b59215
|
Add length instruction
|
2023-04-13 15:24:59 +08:00 |
duzx16
|
c6294ab3fd
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-04-13 15:09:40 +08:00 |
duzx16
|
60275ccec5
|
Update README
|
2023-04-13 15:07:58 +08:00 |
rainatam
|
6fb0380847
|
Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main
|
2023-04-13 13:53:01 +08:00 |
rainatam
|
5fb705cd5b
|
Add option for saving checkpoint
|
2023-04-13 13:52:35 +08:00 |
duzx16
|
7607cfe585
|
Fix turn_idx in eval
|
2023-04-12 23:41:37 +08:00 |
duzx16
|
f06df225dd
|
Fix turn_idx
|
2023-04-12 23:34:44 +08:00 |
duzx16
|
da626f8b23
|
Add instruction for pre_seq_len
|
2023-04-12 22:42:36 +08:00 |
rainatam
|
a1d9dcc517
|
Update README
|
2023-04-12 21:11:29 +08:00 |
rainatam
|
1a368afd26
|
Update README
|
2023-04-12 16:43:34 +08:00 |
rainatam
|
3736c1ae98
|
Update README
|
2023-04-12 12:28:25 +08:00 |
duzx16
|
57e9da3822
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-04-12 09:49:20 +08:00 |
duzx16
|
79e4d8ba8a
|
Remove todo
|
2023-04-12 09:49:14 +08:00 |
rainatam
|
0c2806fea8
|
Fix typo
|
2023-04-11 13:48:54 +08:00 |
rainatam
|
75aa887c20
|
Update LoRA evaluation results
|
2023-04-11 01:06:11 +08:00 |
rainatam
|
166a6e70f1
|
Update LoRA evaluation results
|
2023-04-11 01:04:03 +08:00 |
rainatam
|
ec5f258de9
|
Update evaluation results
|
2023-04-10 22:31:08 +08:00 |
rainatam
|
173ccd8d27
|
Update trainer
|
2023-04-10 19:40:21 +08:00 |
rainatam
|
2073ac75d4
|
Update README
|
2023-04-10 18:43:17 +08:00 |
rainatam
|
2a5250ffcb
|
Update trainer
|
2023-04-10 18:32:40 +08:00 |
rainatam
|
47a5ec121e
|
Add deepspeed finetuning scripts
|
2023-04-10 17:28:27 +08:00 |
rainatam
|
cbb9f44e30
|
Save PrefixEncoder params only
|
2023-04-10 17:26:17 +08:00 |
Vinlic科技
|
d694a0087e
|
错别字修正
|
2023-04-07 14:27:02 +08:00 |
duzx16
|
fd3d40fa9f
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-04-06 22:43:10 +08:00 |
duzx16
|
ea682a6f51
|
Update default hyperparameters
Remove hardcode token id
|
2023-04-06 22:42:41 +08:00 |
rainatam
|
0cf3d08841
|
Update README
|
2023-04-06 22:30:30 +08:00 |
rainatam
|
a1ecafd91f
|
Update README
|
2023-04-06 20:47:32 +08:00 |
rainatam
|
ed79244725
|
Update README
|
2023-04-06 20:46:26 +08:00 |
rainatam
|
83ef59b146
|
Merge branch 'main' of https://github.com/THUDM/ChatGLM-6B into main
|
2023-04-06 20:23:46 +08:00 |
rainatam
|
1cbe2d1981
|
Remove logging
|
2023-04-06 20:22:56 +08:00 |
rainatam
|
5865924cc6
|
Add training for chat data
|
2023-04-06 20:21:29 +08:00 |
duzx16
|
6792ca6805
|
Add English readme
|
2023-04-06 17:55:31 +08:00 |
duzx16
|
7131d29f2d
|
Add English readme
|
2023-04-06 17:51:20 +08:00 |
rainatam
|
a9fc018444
|
Update evaluation results and bleu score function
|
2023-04-06 15:16:36 +08:00 |
Qingsong Lv
|
5de0055408
|
fix finetune pad bug and add sat readme
|
2023-04-03 11:27:28 +00:00 |
duzx16
|
ed9631a96b
|
Add deploement for ptuning model
|
2023-04-03 15:23:32 +08:00 |