Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
zbian 7bc0afc901 updated flash attention usage 2 years ago
..
cuda_native updated flash attention usage 2 years ago
jit [misc] add reference (#2930) 2 years ago
__init__.py [setup] support pre-build and jit-build of cuda kernels (#2374) 2 years ago
op_builder