duzx16
|
4227999d4c
|
No padding in colloator
|
2023-04-02 02:05:03 +08:00 |
duzx16
|
c508f62b70
|
Fix position_ids in prediction
|
2023-04-02 01:59:07 +08:00 |
duzx16
|
ca43864f39
|
Change quantization instruction
|
2023-04-02 00:36:14 +08:00 |
duzx16
|
4371f7a572
|
Add padding for evaluation data
|
2023-04-01 23:09:26 +08:00 |
duzx16
|
41a511ea06
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-04-01 19:49:06 +08:00 |
maybeluo
|
7a67ddd61f
|
write generated result with utf-8
|
2023-04-01 00:34:19 +08:00 |
duzx16
|
7436f0840f
|
Add todo
|
2023-03-31 22:55:36 +08:00 |
duzx16
|
ff3761fc1a
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-03-31 20:15:51 +08:00 |
duzx16
|
73f4fe1ffe
|
Add validation file name
Use full prediction
|
2023-03-31 20:15:35 +08:00 |
rainatam
|
893706a82d
|
Update train script
|
2023-03-31 18:12:04 +08:00 |
duzx16
|
08d880141d
|
Fix revision for loading model
|
2023-03-31 16:32:34 +08:00 |
duzx16
|
08f0731e94
|
Merge branch 'main' of github.com:THUDM/ChatGLM-6B
|
2023-03-31 15:19:16 +08:00 |
duzx16
|
c206e7d9ad
|
Update requirements.txt
|
2023-03-31 15:18:21 +08:00 |
Aohan Zeng
|
9853cd2c97
|
Update README.md
|
2023-03-31 12:26:09 +08:00 |
Aohan Zeng
|
99875468dd
|
Update README.md
|
2023-03-31 11:46:21 +08:00 |
duzx16
|
24e24d5d6c
|
Fix model path
|
2023-03-31 11:30:36 +08:00 |
duzx16
|
5e818065e4
|
Update memory requirement
|
2023-03-31 11:29:34 +08:00 |
duzx16
|
d2645d8816
|
Update batch size
|
2023-03-31 11:28:13 +08:00 |
duzx16
|
971a6fbb20
|
Updaet ADGEN link
|
2023-03-31 11:27:29 +08:00 |
duzx16
|
77da046839
|
Update model path
|
2023-03-31 10:49:21 +08:00 |
duzx16
|
968a30672a
|
Add P-Tuning v2
|
2023-03-31 10:43:55 +08:00 |