You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/colossalai/kernel/cuda_native/csrc
Jun Gao dce05da535
fix thrust-transform-reduce error (#5078)
1 year ago
..
gptq [NFC] polish code style (#4799) 1 year ago
kernels fix thrust-transform-reduce error (#5078) 1 year ago
smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
colossal_C_frontend.cpp
compat.h [misc] update pre-commit and run all files (#4752) 1 year ago
cpu_adam.cpp [kernel] support pure fp16 for cpu adam and update gemini optim tests (#4921) 1 year ago
cpu_adam.h [npu] add npu support for gemini and zero (#5067) 1 year ago
cpu_adam_arm.cpp [npu] add npu support for gemini and zero (#5067) 1 year ago
cpu_adam_arm.h [npu] add npu support for gemini and zero (#5067) 1 year ago
layer_norm_cuda.cpp [misc] update pre-commit and run all files (#4752) 1 year ago
layer_norm_cuda_kernel.cu [misc] update pre-commit and run all files (#4752) 1 year ago
moe_cuda.cpp [misc] update pre-commit and run all files (#4752) 1 year ago
moe_cuda_kernel.cu [misc] update pre-commit and run all files (#4752) 1 year ago
multi_tensor_adam.cu
multi_tensor_apply.cuh
multi_tensor_l2norm_kernel.cu [misc] update pre-commit and run all files (#4752) 1 year ago
multi_tensor_lamb.cu [misc] update pre-commit and run all files (#4752) 1 year ago
multi_tensor_scale_kernel.cu [misc] update pre-commit and run all files (#4752) 1 year ago
multi_tensor_sgd_kernel.cu [misc] update pre-commit and run all files (#4752) 1 year ago
multihead_attention_1d.cpp
multihead_attention_1d.h
scaled_masked_softmax.cpp [misc] update pre-commit and run all files (#4752) 1 year ago
scaled_masked_softmax.h [misc] update pre-commit and run all files (#4752) 1 year ago
scaled_masked_softmax_cuda.cu
scaled_upper_triang_masked_softmax.cpp
scaled_upper_triang_masked_softmax.h [misc] update pre-commit and run all files (#4752) 1 year ago
scaled_upper_triang_masked_softmax_cuda.cu
type_shim.h