ColossalAI/extensions/csrc
Steve Luo ed431de4e4
fix rmsnorm template function invocation problem(template function partial specialization is not allowed in Cpp) and luckily pass e2e precision test (#5454)
2024-03-13 16:00:55 +08:00
..
arm [feat] refactored extension module (#5298) 2024-01-25 17:01:48 +08:00
common optimize rmsnorm: add vectorized elementwise op, feat loop unrolling (#5441) 2024-03-12 17:48:02 +08:00
cuda fix rmsnorm template function invocation problem(template function partial specialization is not allowed in Cpp) and luckily pass e2e precision test (#5454) 2024-03-13 16:00:55 +08:00
x86 refactor code 2024-03-11 17:06:57 +08:00
__init__.py [feat] refactored extension module (#5298) 2024-01-25 17:01:48 +08:00
scaled_softmax.py [feat] refactored extension module (#5298) 2024-01-25 17:01:48 +08:00