Commit Graph

9 Commits (17cfa5714083a81a505c097f1c411cd28162d922)

Author SHA1 Message Date
Cuiqing Li 7d7ea2ef41
[Kernels] add necessary kernels (llama & bloom) for attention forward and kv-cache manager (#4485)
* added _vllm_rms_norm

* change place

* added tests

* added tests

* modify

* adding kernels

* added tests:

* adding kernels

* modify

* added

* updating kernels

* adding tests

* added tests

* kernel change

* submit

* modify

* added

* edit comments

* change name

* change commnets and fix import

* add

* added
2023-08-24 16:30:02 +08:00
zbian 7bc0afc901 updated flash attention usage 2023-03-20 17:57:04 +08:00
ver217 090f14fd6b
[misc] add reference (#2930)
* [misc] add reference

* [misc] add license
2023-02-28 18:07:24 +08:00
Frank Lee 918bc94b6b
[triton] added copyright information for flash attention (#2835)
* [triton] added copyright information for flash attention

* polish code
2023-02-21 11:25:57 +08:00
YuliangLiu0306 2059fdd6b0
[hotfix] add copyright for solver and device mesh (#2803)
* [hotfix] add copyright for solver and device mesh

* add readme

* add alpa license

* polish
2023-02-18 21:14:38 +08:00
binmakeswell d00d905b86
[NFC] polish license (#1999) 2022-11-22 16:26:47 +08:00
binmakeswell 8a29ce5443
polish license (#1522) 2022-09-01 15:31:58 +08:00
Jiarui Fang 8f74fbd9c9 polish license (#300)
* init shard param from shape tuple

* add more unitest for shard param
2022-03-11 15:50:28 +08:00
アマデウス 2ebaefc542
Initial commit 2021-10-29 00:19:45 +08:00