ChatGLM-6B/README.md

# ChatGLM-6B

## 介绍

ChatGLM-6B 是一个开源的、支持中英双语的对话语言模型，基于 [General Language Model (GLM)](https://github.com/THUDM/GLM) 架构，具有 62 亿参数。结合模型量化技术，用户可以在消费级的显卡上进行本地部署（INT4 量化级别下最低只需 6GB 显存）。
ChatGLM-6B 使用了和 ChatGPT 相似的技术，针对中文问答和对话进行了优化。经过约 1T 标识符的中英双语训练，辅以监督微调、反馈自助、人类反馈强化学习等技术的加持，62 亿参数的 ChatGLM-6B 已经能生成相当符合人类偏好的回答。更多信息请参考我们的[博客](https://chatglm.cn/blog)。

不过，由于ChatGLM-6B的规模较小，目前已知其具有相当多的[**局限性**](#局限性)，如事实性/数学逻辑错误，可能生成有害/有偏见内容，较弱的上下文能力，自我认知混乱，以及对英文指示生成与中文指示完全矛盾的内容。请大家在使用前了解这些问题，以免产生误解。

*Read this in [English](README_en.md).*

## 更新信息
**[2023/03/19]** 增加流式输出接口`stream_chat`，已更新到网页版和命令行demo。修复输出中的中文标点。增加量化后的模型 [ChatGLM-6B-INT4](https://huggingface.co/THUDM/chatglm-6b-int4)

## 使用方式

### 硬件需求

| **量化等级**    | **最低 GPU 显存** |
| -------------- | ----------------- |
| FP16（无量化）   | 13 GB             |
| INT8           | 10 GB              |
| INT4           | 6 GB               |

### 环境安装

使用 pip 安装依赖：`pip install -r requirements.txt`，其中 `transformers` 库版本推荐为 `4.26.1`，但理论上不低于 `4.23.1` 即可。

### 代码调用 

可以通过如下代码调用 ChatGLM-6B 模型来生成对话：

```python
>>> from transformers import AutoTokenizer, AutoModel
>>> tokenizer = AutoTokenizer.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True)
>>> model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).half().cuda()
>>> model = model.eval()
>>> response, history = model.chat(tokenizer, "你好", history=[])
>>> print(response)
你好👋!我是人工智能助手 ChatGLM-6B,很高兴见到你,欢迎问我任何问题。
>>> response, history = model.chat(tokenizer, "晚上睡不着应该怎么办", history=history)
>>> print(response)
晚上睡不着可能会让你感到焦虑或不舒服,但以下是一些可以帮助你入睡的方法:

1. 制定规律的睡眠时间表:保持规律的睡眠时间表可以帮助你建立健康的睡眠习惯,使你更容易入睡。尽量在每天的相同时间上床,并在同一时间起床。
2. 创造一个舒适的睡眠环境:确保睡眠环境舒适,安静,黑暗且温度适宜。可以使用舒适的床上用品,并保持房间通风。
3. 放松身心:在睡前做些放松的活动,例如泡个热水澡,听些轻柔的音乐,阅读一些有趣的书籍等,有助于缓解紧张和焦虑,使你更容易入睡。
4. 避免饮用含有咖啡因的饮料:咖啡因是一种刺激性物质,会影响你的睡眠质量。尽量避免在睡前饮用含有咖啡因的饮料,例如咖啡,茶和可乐。
5. 避免在床上做与睡眠无关的事情:在床上做些与睡眠无关的事情,例如看电影,玩游戏或工作等,可能会干扰你的睡眠。
6. 尝试呼吸技巧:深呼吸是一种放松技巧,可以帮助你缓解紧张和焦虑,使你更容易入睡。试着慢慢吸气,保持几秒钟,然后缓慢呼气。

如果这些方法无法帮助你入睡,你可以考虑咨询医生或睡眠专家,寻求进一步的建议。
```
完整的模型实现可以在 [Hugging Face Hub](https://huggingface.co/THUDM/chatglm-6b) 上查看。如果你从Hugging Face Hub上下载checkpoint的速度较慢，也可以从[这里](https://cloud.tsinghua.edu.cn/d/fb9f16d6dc8f482596c2/)手动下载。

### Demo

我们提供了一个基于 [Gradio](https://gradio.app) 的网页版 Demo 和一个命令行 Demo。使用时首先需要下载本仓库：

```shell
git clone https://github.com/THUDM/ChatGLM-6B
cd ChatGLM-6B
```

#### 网页版 Demo

![web-demo](resources/web-demo.png)

首先安装 Gradio：`pip install gradio`，然后运行仓库中的 [web_demo.py](web_demo.py)： 

```shell
python web_demo.py
```

程序会运行一个 Web Server，并输出地址。在浏览器中打开输出的地址即可使用。

感谢[@AdamBear](https://github.com/AdamBear) 实现了基于Streamlit的网页版demo，运行方式见[#117](https://github.com/THUDM/ChatGLM-6B/pull/117).

#### 命令行 Demo

![cli-demo](resources/cli-demo.png)

运行仓库中 [cli_demo.py](cli_demo.py)：

```shell
python cli_demo.py
```

程序会在命令行中进行交互式的对话，在命令行中输入指示并回车即可生成回复，输入`clear`可以清空对话历史，输入`stop`终止程序。

## 低成本部署
### 模型量化
默认情况下，模型以 FP16 精度加载，运行上述代码需要大概 13GB 显存。如果你的 GPU 显存有限，可以尝试以量化方式加载模型，使用方法如下：

```python
# 按需修改，目前只支持 4/8 bit 量化
model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).half().quantize(4).cuda()
```

进行 2 至 3 轮对话后，8-bit 量化下 GPU 显存占用约为 10GB，4-bit 量化下仅需 6GB 占用。随着对话轮数的增多，对应消耗显存也随之增长，由于采用了相对位置编码，理论上 ChatGLM-6B 支持无限长的 context-length，但总长度超过 2048（训练长度）后性能会逐渐下降。

模型量化会带来一定的性能损失，经过测试，ChatGLM-6B 在 4-bit 量化下仍然能够进行自然流畅的生成。使用 [GPT-Q](https://arxiv.org/abs/2210.17323) 等量化方案可以进一步压缩量化精度/提升相同量化精度下的模型性能，欢迎大家提出对应的 Pull Request。

**[2023/03/19]** 量化过程需要在内存中首先加载fp16格式的模型，消耗大概13GB的内存。如果你的内存不足的话，可以直接加载量化后的模型，仅需大概5.2GB的内存：
```python
model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4", trust_remote_code=True).half().cuda()
```

### CPU部署
如果你没有GPU硬件的话，也可以在CPU上进行推理，但是推理速度会更慢。使用方法如下（需要大概32GB内存）
```python
model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).float()
```

**[2023/03/19]** 如果你的内存不足，可以直接加载量化后的模型：
```python
model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4",trust_remote_code=True).float()
```

如果遇到了报错 `Could not find module 'nvcuda.dll'` 或者 `RuntimeError: Unknown platform: darwin` (MacOS) 的话请参考这个[Issue](https://github.com/THUDM/ChatGLM-6B/issues/6#issuecomment-1470060041).

## ChatGLM-6B示例

以下是一些使用`web_demo.py`得到的示例截图。更多ChatGLM-6B的可能，等待你来探索发现！

<details><summary><b>自我认知</b></summary>

![](examples/self-introduction.png)

</details>

<details><summary><b>提纲写作</b></summary>

![](examples/blog-outline.png)

</details>

<details><summary><b>文案写作</b></summary>

![](examples/ad-writing-2.png)

![](examples/comments-writing.png)

</details>

<details><summary><b>邮件写作助手</b></summary>

![](examples/email-writing-1.png)

![](examples/email-writing-2.png)

</details>

<details><summary><b>信息抽取</b></summary>

![](examples/information-extraction.png)

</details>

<details><summary><b>角色扮演</b></summary>

![](examples/role-play.png)

</details>

<details><summary><b>评论比较</b></summary>

![](examples/sport.png)

</details>

<details><summary><b>旅游向导</b></summary>

![](examples/tour-guide.png)

</details>

## 局限性

由于ChatGLM-6B的小规模，其能力仍然有许多局限性。以下是我们目前发现的一些问题：

- 模型容量较小：6B的小容量，决定了其相对较弱的模型记忆和语言能力。在面对许多事实性知识任务时，ChatGLM-6B可能会生成不正确的信息；它也不擅长逻辑类问题（如数学、编程）的解答。
    <details><summary><b>点击查看例子</b></summary>
    
    ![](limitations/factual_error.png)
    
    ![](limitations/math_error.png)
    
    </details>
  
- 产生有害说明或有偏见的内容：ChatGLM-6B只是一个初步与人类意图对齐的语言模型，可能会生成有害、有偏见的内容。（内容可能具有冒犯性，此处不展示）

- 英文能力不足：ChatGLM-6B 训练时使用的指示/回答大部分都是中文的，仅有极小一部分英文内容。因此，如果输入英文指示，回复的质量远不如中文，甚至与中文指示下的内容矛盾，并且出现中英夹杂的情况。

- 易被误导，对话能力较弱：ChatGLM-6B 对话能力还比较弱，而且 “自我认知” 存在问题，并很容易被误导并产生错误的言论。例如当前版本的模型在被误导的情况下，会在自我认知上发生偏差。
    <details><summary><b>点击查看例子</b></summary>

    ![](limitations/self-confusion_google.jpg)
    
    ![](limitations/self-confusion_openai.jpg)
    
    ![](limitations/self-confusion_tencent.jpg)
    
    </details>

## 协议

本仓库的代码依照 [Apache-2.0](LICENSE) 协议开源，ChatGLM-6B 模型的权重的使用则需要遵循 [Model License](MODEL_LICENSE)。

## 引用

如果你觉得我们的工作有帮助的话，请考虑引用下列论文

```
@inproceedings{
  zeng2023glm-130b,
  title={{GLM}-130B: An Open Bilingual Pre-trained Model},
  author={Aohan Zeng and Xiao Liu and Zhengxiao Du and Zihan Wang and Hanyu Lai and Ming Ding and Zhuoyi Yang and Yifan Xu and Wendi Zheng and Xiao Xia and Weng Lam Tam and Zixuan Ma and Yufei Xue and Jidong Zhai and Wenguang Chen and Zhiyuan Liu and Peng Zhang and Yuxiao Dong and Jie Tang},
  booktitle={The Eleventh International Conference on Learning Representations (ICLR)},
  year={2023},
  url={https://openreview.net/forum?id=-Aw0rrrPUF}
}
```
```
@inproceedings{du2022glm,
  title={GLM: General Language Model Pretraining with Autoregressive Blank Infilling},
  author={Du, Zhengxiao and Qian, Yujie and Liu, Xiao and Ding, Ming and Qiu, Jiezhong and Yang, Zhilin and Tang, Jie},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
  pages={320--335},
  year={2022}
}
```
-												Init commit

											
										
										
											2 years ago
+								# ChatGLM-6B
-												Update README.md
											
										
										
											2 years ago
-												Init commit

											
										
										
											2 years ago
+								## 介绍
-												Update README.md
											
										
										
											2 years ago
-												Update limitations

											
										
										
											2 years ago
+								ChatGLM-6B 是一个开源的、支持中英双语的对话语言模型，基于 [General Language Model (GLM)](https://github.com/THUDM/GLM) 架构，具有 62 亿参数。结合模型量化技术，用户可以在消费级的显卡上进行本地部署（INT4 量化级别下最低只需 6GB 显存）。
 								ChatGLM-6B 使用了和 ChatGPT 相似的技术，针对中文问答和对话进行了优化。经过约 1T 标识符的中英双语训练，辅以监督微调、反馈自助、人类反馈强化学习等技术的加持，62 亿参数的 ChatGLM-6B 已经能生成相当符合人类偏好的回答。更多信息请参考我们的[博客](https://chatglm.cn/blog)。
-												Update limitations

											
										
										
											2 years ago
+								不过，由于ChatGLM-6B的规模较小，目前已知其具有相当多的[**局限性**](#局限性)，如事实性/数学逻辑错误，可能生成有害/有偏见内容，较弱的上下文能力，自我认知混乱，以及对英文指示生成与中文指示完全矛盾的内容。请大家在使用前了解这些问题，以免产生误解。
-												Update README

											
										
										
											2 years ago
-												Add link to English README

											
										
										
											2 years ago
+								*Read this in [English](README_en.md).*
-												Update README

											
										
										
											2 years ago
+								## 更新信息
-												Update README

											
										
										
											2 years ago
+								**[2023/03/19]** 增加流式输出接口`stream_chat`，已更新到网页版和命令行demo。修复输出中的中文标点。增加量化后的模型 [ChatGLM-6B-INT4](https://huggingface.co/THUDM/chatglm-6b-int4)
-												Update README

											
										
										
											2 years ago
 								## 使用方式
 								### 硬件需求
-												Update README.md
											
										
										
											2 years ago
 								| **量化等级**    | **最低 GPU 显存** |
-												Update README.md
											
										
										
											2 years ago
+								| -------------- | ----------------- |
 								| FP16（无量化）   | 13 GB             |
 								| INT8           | 10 GB              |
 								| INT4           | 6 GB               |
-												Init commit

											
										
										
											2 years ago
-												Update README.md
											
										
										
											2 years ago
+								### 环境安装
 								使用 pip 安装依赖：`pip install -r requirements.txt`，其中 `transformers` 库版本推荐为 `4.26.1`，但理论上不低于 `4.23.1` 即可。
-												Init commit

											
										
										
											2 years ago
 								### 代码调用
-												Update README.md
											
										
										
											2 years ago
+								可以通过如下代码调用 ChatGLM-6B 模型来生成对话：
-												Init commit

											
										
										
											2 years ago
-												Update README.md
											
										
										
											2 years ago
+								```python
-												Update README.md
											
										
										
											2 years ago
+								>>> from transformers import AutoTokenizer, AutoModel
 								>>> tokenizer = AutoTokenizer.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True)
 								>>> model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).half().cuda()
-												Update README

											
										
										
											2 years ago
+								>>> model = model.eval()
-												Update README.md
											
										
										
											2 years ago
+								>>> response, history = model.chat(tokenizer, "你好", history=[])
 								>>> print(response)
 								你好👋!我是人工智能助手 ChatGLM-6B,很高兴见到你,欢迎问我任何问题。
 								>>> response, history = model.chat(tokenizer, "晚上睡不着应该怎么办", history=history)
 								>>> print(response)
 								晚上睡不着可能会让你感到焦虑或不舒服,但以下是一些可以帮助你入睡的方法:
 . 制定规律的睡眠时间表:保持规律的睡眠时间表可以帮助你建立健康的睡眠习惯,使你更容易入睡。尽量在每天的相同时间上床,并在同一时间起床。
 . 创造一个舒适的睡眠环境:确保睡眠环境舒适,安静,黑暗且温度适宜。可以使用舒适的床上用品,并保持房间通风。
 . 放松身心:在睡前做些放松的活动,例如泡个热水澡,听些轻柔的音乐,阅读一些有趣的书籍等,有助于缓解紧张和焦虑,使你更容易入睡。
 . 避免饮用含有咖啡因的饮料:咖啡因是一种刺激性物质,会影响你的睡眠质量。尽量避免在睡前饮用含有咖啡因的饮料,例如咖啡,茶和可乐。
 . 避免在床上做与睡眠无关的事情:在床上做些与睡眠无关的事情,例如看电影,玩游戏或工作等,可能会干扰你的睡眠。
 . 尝试呼吸技巧:深呼吸是一种放松技巧,可以帮助你缓解紧张和焦虑,使你更容易入睡。试着慢慢吸气,保持几秒钟,然后缓慢呼气。
 								如果这些方法无法帮助你入睡,你可以考虑咨询医生或睡眠专家,寻求进一步的建议。
-												Init commit

											
										
										
											2 years ago
+								```
-												Add Cloud link

											
										
										
											2 years ago
+								完整的模型实现可以在 [Hugging Face Hub](https://huggingface.co/THUDM/chatglm-6b) 上查看。如果你从Hugging Face Hub上下载checkpoint的速度较慢，也可以从[这里](https://cloud.tsinghua.edu.cn/d/fb9f16d6dc8f482596c2/)手动下载。
-												Init commit

											
										
										
											2 years ago
 								### Demo
-												Update README

											
										
										
											2 years ago
+								我们提供了一个基于 [Gradio](https://gradio.app) 的网页版 Demo 和一个命令行 Demo。使用时首先需要下载本仓库：
-												Update README.md
											
										
										
											2 years ago
-												Update README

											
										
										
											2 years ago
+								```shell
 								git clone https://github.com/THUDM/ChatGLM-6B
 								cd ChatGLM-6B
 								```
-												Init commit

											
										
										
											2 years ago
 								#### 网页版 Demo
 								![web-demo](resources/web-demo.png)
-												Update README.md
											
										
										
											2 years ago
+								首先安装 Gradio：`pip install gradio`，然后运行仓库中的 [web_demo.py](web_demo.py)：
-												Init commit

											
										
										
											2 years ago
 								```shell
 								python web_demo.py
 								```
 								程序会运行一个 Web Server，并输出地址。在浏览器中打开输出的地址即可使用。
-												Update README.md

											
										
										
											2 years ago
+								感谢[@AdamBear](https://github.com/AdamBear) 实现了基于Streamlit的网页版demo，运行方式见[#117](https://github.com/THUDM/ChatGLM-6B/pull/117).
-												Init commit

											
										
										
											2 years ago
+								#### 命令行 Demo
 								![cli-demo](resources/cli-demo.png)
-												Update README

											
										
										
											2 years ago
+								运行仓库中 [cli_demo.py](cli_demo.py)：
-												Init commit

											
										
										
											2 years ago
 								```shell
 								python cli_demo.py
 								```
 								程序会在命令行中进行交互式的对话，在命令行中输入指示并回车即可生成回复，输入`clear`可以清空对话历史，输入`stop`终止程序。
-												Add CPU support

											
										
										
											2 years ago
+								## 低成本部署
 								### 模型量化
-												Update README.md
											
										
										
											2 years ago
+								默认情况下，模型以 FP16 精度加载，运行上述代码需要大概 13GB 显存。如果你的 GPU 显存有限，可以尝试以量化方式加载模型，使用方法如下：
-												Init commit

											
										
										
											2 years ago
 								```python
-												Update README.md
											
										
										
											2 years ago
+								# 按需修改，目前只支持 4/8 bit 量化
-												Update README.md

											
										
										
											2 years ago
+								model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).half().quantize(4).cuda()
 								```
-												Update README.md
											
										
										
											2 years ago
+								进行 2 至 3 轮对话后，8-bit 量化下 GPU 显存占用约为 10GB，4-bit 量化下仅需 6GB 占用。随着对话轮数的增多，对应消耗显存也随之增长，由于采用了相对位置编码，理论上 ChatGLM-6B 支持无限长的 context-length，但总长度超过 2048（训练长度）后性能会逐渐下降。
-												Add README_en

											
										
										
											2 years ago
+								模型量化会带来一定的性能损失，经过测试，ChatGLM-6B 在 4-bit 量化下仍然能够进行自然流畅的生成。使用 [GPT-Q](https://arxiv.org/abs/2210.17323) 等量化方案可以进一步压缩量化精度/提升相同量化精度下的模型性能，欢迎大家提出对应的 Pull Request。
-												Add CPU support

											
										
										
											2 years ago
-												Update README

											
										
										
											2 years ago
+								**[2023/03/19]** 量化过程需要在内存中首先加载fp16格式的模型，消耗大概13GB的内存。如果你的内存不足的话，可以直接加载量化后的模型，仅需大概5.2GB的内存：
 								```python
 								model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4", trust_remote_code=True).half().cuda()
 								```
-												Add CPU support

											
										
										
											2 years ago
+								### CPU部署
-												Update README

											
										
										
											2 years ago
+								如果你没有GPU硬件的话，也可以在CPU上进行推理，但是推理速度会更慢。使用方法如下（需要大概32GB内存）
-												Add CPU support

											
										
										
											2 years ago
+								```python
 								model = AutoModel.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True).float()
 								```
-												Init commit

											
										
										
											2 years ago
-												Update README

											
										
										
											2 years ago
+								**[2023/03/19]** 如果你的内存不足，可以直接加载量化后的模型：
-												Add CPU support with less than 16G memory

											
										
										
											2 years ago
+								```python
-												Update README

											
										
										
											2 years ago
+								model = AutoModel.from_pretrained("THUDM/chatglm-6b-int4",trust_remote_code=True).float()
-												Add CPU support with less than 16G memory

											
										
										
											2 years ago
+								```
-												Update README.md

											
										
										
											2 years ago
+								如果遇到了报错 `Could not find module 'nvcuda.dll'` 或者 `RuntimeError: Unknown platform: darwin` (MacOS) 的话请参考这个[Issue](https://github.com/THUDM/ChatGLM-6B/issues/6#issuecomment-1470060041).
-												Add MacOS solution

											
										
										
											2 years ago
-												Add examples to README.md

											
										
										
											2 years ago
+								## ChatGLM-6B示例
-												Update example description

											
										
										
											2 years ago
+								以下是一些使用`web_demo.py`得到的示例截图。更多ChatGLM-6B的可能，等待你来探索发现！
 								<details><summary><b>自我认知</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
 								![](examples/self-introduction.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>提纲写作</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/blog-outline.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>文案写作</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/ad-writing-2.png)
 								![](examples/comments-writing.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>邮件写作助手</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/email-writing-1.png)
 								![](examples/email-writing-2.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>信息抽取</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/information-extraction.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>角色扮演</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/role-play.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
-												Update example description

											
										
										
											2 years ago
+								<details><summary><b>评论比较</b></summary>
-												Update example description

											
										
										
											2 years ago
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/sport.png)
-												Update example description

											
										
										
											2 years ago
+								</details>
 								<details><summary><b>旅游向导</b></summary>
-												Add examples to README.md

											
										
										
											2 years ago
+								![](examples/tour-guide.png)
 								</details>
-												Update limitations

											
										
										
											2 years ago
+								## 局限性
 								由于ChatGLM-6B的小规模，其能力仍然有许多局限性。以下是我们目前发现的一些问题：
 								- 模型容量较小：6B的小容量，决定了其相对较弱的模型记忆和语言能力。在面对许多事实性知识任务时，ChatGLM-6B可能会生成不正确的信息；它也不擅长逻辑类问题（如数学、编程）的解答。
 								    <details><summary><b>点击查看例子</b></summary>
 								    ![](limitations/factual_error.png)
 								    ![](limitations/math_error.png)
 								    </details>
 								- 产生有害说明或有偏见的内容：ChatGLM-6B只是一个初步与人类意图对齐的语言模型，可能会生成有害、有偏见的内容。（内容可能具有冒犯性，此处不展示）
-												Update limitations

											
										
										
											2 years ago
+								- 英文能力不足：ChatGLM-6B 训练时使用的指示/回答大部分都是中文的，仅有极小一部分英文内容。因此，如果输入英文指示，回复的质量远不如中文，甚至与中文指示下的内容矛盾，并且出现中英夹杂的情况。
-												Update limitations

											
										
										
											2 years ago
+								- 易被误导，对话能力较弱：ChatGLM-6B 对话能力还比较弱，而且 “自我认知” 存在问题，并很容易被误导并产生错误的言论。例如当前版本的模型在被误导的情况下，会在自我认知上发生偏差。
 								    <details><summary><b>点击查看例子</b></summary>
 								    ![](limitations/self-confusion_google.jpg)
 								    ![](limitations/self-confusion_openai.jpg)
 								    ![](limitations/self-confusion_tencent.jpg)
 								    </details>
-												Update README.md
											
										
										
											2 years ago
+								## 协议
 								本仓库的代码依照 [Apache-2.0](LICENSE) 协议开源，ChatGLM-6B 模型的权重的使用则需要遵循 [Model License](MODEL_LICENSE)。
-												Init commit

											
										
										
											2 years ago
+								## 引用
 								如果你觉得我们的工作有帮助的话，请考虑引用下列论文
 								```
 								@inproceedings{
 								  zeng2023glm-130b,
 								  title={{GLM}-130B: An Open Bilingual Pre-trained Model},
 								  author={Aohan Zeng and Xiao Liu and Zhengxiao Du and Zihan Wang and Hanyu Lai and Ming Ding and Zhuoyi Yang and Yifan Xu and Wendi Zheng and Xiao Xia and Weng Lam Tam and Zixuan Ma and Yufei Xue and Jidong Zhai and Wenguang Chen and Zhiyuan Liu and Peng Zhang and Yuxiao Dong and Jie Tang},
 								  booktitle={The Eleventh International Conference on Learning Representations (ICLR)},
 								  year={2023},
 								  url={https://openreview.net/forum?id=-Aw0rrrPUF}
 								}
 								```
 								```
 								@inproceedings{du2022glm,
 								  title={GLM: General Language Model Pretraining with Autoregressive Blank Infilling},
 								  author={Du, Zhengxiao and Qian, Yujie and Liu, Xiao and Ding, Ming and Qiu, Jiezhong and Yang, Zhilin and Tang, Jie},
 								  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
 								  pages={320--335},
 								  year={2022}
 								}
-												Update README.md
											
										
										
											2 years ago
+								```