Merge branch 'main' of github.com:THUDM/ChatGLM-6B

2023-04-01 19:49:06 +08:00 · 2023-04-01 19:49:06 +08:00 · 41a511ea06
parent 7436f0840f e4fe9aaf08
commit 41a511ea06
3 changed files with 9 additions and 5 deletions
--- a/README.md
+++ b/README.md
@ -169,6 +169,8 @@ model = AutoModel.from_pretrained("your local path", trust_remote_code=True).hal
 ## 高效参数微调
 基于 [P-tuning v2](https://github.com/THUDM/P-tuning-v2) 的高效参数微调。具体使用方法详见 [ptuning/README.md](ptuning/README.md)。

+
+
 ## ChatGLM-6B 示例

 以下是一些使用 `web_demo.py` 得到的示例截图。更多 ChatGLM-6B 的可能，等待你来探索发现！
@ -261,6 +263,7 @@ model = AutoModel.from_pretrained("your local path", trust_remote_code=True).hal
 以下是部分基于本仓库开发的开源项目：
 * [ChatGLM-MNN](https://github.com/wangzhaode/ChatGLM-MNN): 一个基于 MNN 的 ChatGLM-6B C++ 推理实现，支持根据显存大小自动分配计算任务给 GPU 和 CPU
 * [ChatGLM-Tuning](https://github.com/mymusise/ChatGLM-Tuning): 基于 LoRA 对 ChatGLM-6B 进行微调
+* [Humanable ChatGLM/GPT Fine-tuning | ChatGLM 微调](https://github.com/hscspring/hcgf)：基于 LoRA 进行微调

 以下是部分针对本项目的教程/文档：
 * [Windows部署文档](https://github.com/ZhangErling/ChatGLM-6B/blob/main/deployment_windows.md)
--- a/cli_demo.py
+++ b/cli_demo.py
@ -31,9 +31,9 @@ def main():
    print("欢迎使用 ChatGLM-6B 模型，输入内容即可进行对话，clear 清空对话历史，stop 终止程序")
    while True:
        query = input("\n用户：")
-        if query == "stop":
+        if query.strip() == "stop":
            break
-        if query == "clear":
+        if query.strip() == "clear":
            history = []
            os.system(clear_command)
            print("欢迎使用 ChatGLM-6B 模型，输入内容即可进行对话，clear 清空对话历史，stop 终止程序")
--- a/ptuning/main.py
+++ b/ptuning/main.py
@ -374,9 +374,10 @@ def main():
                )
                labels = [label.strip() for label in labels]
                output_prediction_file = os.path.join(training_args.output_dir, "generated_predictions.txt")
-                with open(output_prediction_file, "w") as writer:
+                with open(output_prediction_file, "w", encoding="utf-8") as writer:
                    for p, l in zip(predictions, labels):
-                        writer.write(json.dumps({"labels": l, "predict": p}, ensure_ascii=False))
+                        res = json.dumps({"labels": l, "predict": p}, ensure_ascii=False)
+                        writer.write(f"{res}\n")
    return results


@ -386,4 +387,4 @@ def _mp_fn(index):


 if __name__ == "__main__":
-    main()
+    main()