ColossalAI/colossalai/inference/tensor_parallel/policies
Zhongkai Zhao 70885d707d
[hotfix] Suport extra_kwargs in ShardConfig (#5031)
* [refactor]: replace inference args with extra_kwargs in ShardConfig

* modify shardconfig

* polish code

* fix policy bug in llama

* fix bug in auto policy

* remove setattr in ShardConfig
2023-11-10 10:49:50 +08:00
..
__init__.py [inference] chatglm2 infer demo (#4724) 2023-09-22 11:12:50 +08:00
bloom.py [hotfix] Suport extra_kwargs in ShardConfig (#5031) 2023-11-10 10:49:50 +08:00
chatglm2.py [Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014) 2023-11-07 15:01:50 +08:00
llama.py [hotfix] Suport extra_kwargs in ShardConfig (#5031) 2023-11-10 10:49:50 +08:00