ColossalAI/colossalai/inference/modeling/policy
yuehuayingxueluo 2a718c8be8
Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390)
* opt_view_and_memcopy

* fix bugs in ci

* fix ci bugs

* update benchmark scripts

* fix ci bugs
2024-02-21 13:23:57 +08:00
..
__init__.py [Fix/Infer] Remove unused deps and revise requirements (#5341) 2024-02-06 17:27:45 +08:00
nopadding_llama.py Optimized the execution interval time between cuda kernels caused by view and memcopy (#5390) 2024-02-21 13:23:57 +08:00