Commit Graph

18 Commits (c9a7b2af800827f879110609669746dd65107161)

Author SHA1 Message Date
duzx16 4f95c09a3b Add multi-gpu deployment 2023-04-16 21:37:33 +08:00
duzx16 90f2e47f54 Merge branch 'dev' into dev_multi_gpu
# Conflicts:
#	README.md
#	api.py
#	cli_demo.py
#	requirements.txt
2023-04-16 21:14:02 +08:00
duzx16 f1407bec5f Set share=False 2023-04-09 20:41:18 +08:00
duzx16 2833546339 Use chatbot web demo 2023-04-06 17:01:24 +08:00
saber 6a5267aef7 bugfix: linux多卡部署时weight,input不在同一device上,导致RuntimeError 2023-03-27 20:51:05 +08:00
saber 8101d75ab8 fix typo 2023-03-26 15:29:15 +08:00
saber 8826b947c3 多GPU支持, 模型文件夹没有index.json会自动保存模型到multi_gpu_model_cache_dir以支持多GPU 2023-03-26 15:13:44 +08:00
saber 4ee042a8e6 将配置device_map的逻辑抽离, 根据gpu数量自动配置device_map,并且自动适配所有模型 2023-03-26 13:44:10 +08:00
lichuang de9f26c201 初次提交,支持多卡部署。 2023-03-23 10:11:52 +08:00
Aohan Zeng c2ff5be358
Update web_demo.py 2023-03-20 15:34:36 +08:00
duzx16 2ed89f3898 Add support for streaming output 2023-03-19 14:33:05 +08:00
GanymedeNil 702c2ca2a8
Merge branch 'main' into main 2023-03-16 12:01:31 +08:00
duzx16 e565185c29 Add inbrower for web demo 2023-03-16 00:44:07 +08:00
GanymedeNil d11eb5213e Add parameter support for Maximum length, Top P and Temperature in web demo 2023-03-15 18:07:17 +08:00
duzx16 584be64161 Fix default arguments of web demo 2023-03-14 20:30:25 +08:00
duzx16 6113d4c02f Add markdown for web demo 2023-03-14 14:55:49 +08:00
duzx16 a2f8bec32b Update web_demo.py 2023-03-13 23:38:20 +08:00
duzx16 6ff2b4b832 Init commit 2023-03-13 20:06:14 +08:00