8 Commits (ddcf58cacf9581d9c59a18f8276d52a061818fab)

Author SHA1 Message Date
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452) 2 years ago
zbian 7bc0afc901 updated flash attention usage 2 years ago
アマデウス 077a66dd81
updated attention kernel (#2133) 2 years ago
zbian 6877121377 updated flash attention api 2 years ago
oahzxl 9639ea88fc
[kernel] more flexible flashatt interface (#1804) 2 years ago
oahzxl 501a9e9cd2
[hotfix] polish flash attention (#1802) 2 years ago
Jiarui Fang c248800359
[kernel] skip tests of flash_attn and triton when they are not available (#1798) 2 years ago
oahzxl 25952b67d7
[feat] add flash attention (#1762) 2 years ago