ColossalAI/colossalai/kernel/cuda_native
Xuanlei Zhao dd2c28a323
[npu] use extension for op builder (#5172)
* update extension

* update cpu adam

* update is

* add doc for cpu adam

* update kernel

* update commit

* update flash

* update memory efficient

* update flash attn

* update flash attention loader

* update api

* fix

* update doc

* update example time limit

* reverse change

* fix doc

* remove useless kernel

* fix

* not use warning

* update

* update
2024-01-08 11:39:16 +08:00
..
csrc fix thrust-transform-reduce error (#5078) 2023-11-21 15:09:35 +08:00
__init__.py [npu] use extension for op builder (#5172) 2024-01-08 11:39:16 +08:00
layer_norm.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
multihead_attention.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
scaled_softmax.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00