ColossalAI/tests/test_infer_ops/triton
Xu Kai c6295c3381
[Refactor] remove useless inference code (#5022)
* remove useless code

* fix quant model

* fix test import bug

* mv original inference legacy

* fix chatglm2
2023-11-10 14:47:06 +08:00
..
kernel_utils.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_bloom_context_attention.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_copy_kv_dest.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_layernorm_triton.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_llama_act_combine.py [moe] merge moe into main (#4978) 2023-11-02 02:21:24 +00:00
test_llama_context_attention.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_self_attention_nonfusion.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_softmax.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
test_token_attn_fwd.py [Refactor] remove useless inference code (#5022) 2023-11-10 14:47:06 +08:00
test_token_softmax.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00