ColossalAI/colossalai/inference/tensor_parallel/policies
Jianghai ce7ade3882
[inference] chatglm2 infer demo (#4724)
* add chatglm2

* add

* gather needed kernels

* fix some bugs

* finish context forward

* finish context stage

* fix

* add

* pause

* add

* fix bugs

* finish chatglm

* fix bug

* change some logic

* fix bugs

* change some logics

* add

* add

* add

* fix

* fix tests

* fix
2023-09-22 11:12:50 +08:00
..
__init__.py [inference] chatglm2 infer demo (#4724) 2023-09-22 11:12:50 +08:00
bloom.py [feature] add gptq for inference (#4754) 2023-09-22 11:02:50 +08:00
chatglm2.py [inference] chatglm2 infer demo (#4724) 2023-09-22 11:12:50 +08:00
llama.py [feature] add gptq for inference (#4754) 2023-09-22 11:02:50 +08:00