Merge pull request #1095 from chenqy4933/main

增加 InferLLM 的支持
2023-05-28 11:57:42 +08:00 · 2023-05-28 11:57:42 +08:00 · 9c71c05b3b
parent 8a8cb0aed4 0de717ba9a
commit 9c71c05b3b
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -63,7 +63,7 @@ ChatGLM-6B 开源模型旨在与开源社区一起推动大模型技术发展，
 对 ChatGLM 进行加速的开源项目：
 * [ChatGLM-MNN](https://github.com/wangzhaode/ChatGLM-MNN): 一个基于 MNN 的 ChatGLM-6B C++ 推理实现，支持根据显存大小自动分配计算任务给 GPU 和 CPU
 * [JittorLLMs](https://github.com/Jittor/JittorLLMs)：最低3G显存或者没有显卡都可运行 ChatGLM-6B FP16， 支持Linux、windows、Mac部署
-
+* [InferLLM](https://github.com/MegEngine/InferLLM)：轻量级 C++ 推理，可以实现本地 x86，Arm 处理器上实时聊天，手机上也同样可以实时运行，运行内存只需要 4G
 基于或使用了 ChatGLM-6B 的开源项目：
 * [langchain-ChatGLM](https://github.com/imClumsyPanda/langchain-ChatGLM)：基于 langchain 的 ChatGLM 应用，实现基于可扩展知识库的问答
 * [闻达](https://github.com/l15y/wenda)：大型语言模型调用平台，基于 ChatGLM-6B 实现了类 ChatPDF 功能