58 Commits (8754abae24dbcc492d2992d1091428592b615285)

Author SHA1 Message Date
linsj20 91fa553775 [Feature] qlora support (#5586) 7 months ago
flybird11111 8954a0c2e2 [LowLevelZero] low level zero support lora (#5153) 7 months ago
Baizhou Zhang 14b0d4c7e5 [lora] add lora APIs for booster, support lora for TorchDDP (#4981) 7 months ago
Wang Binluo 0d0a582033
[shardformer] update transformers (#5583) 7 months ago
Hongxin Liu 3788fefc7a
[zero] support multiple (partial) backward passes (#5596) 7 months ago
Yuanheng Zhao 1dedb57747
[Fix/Infer] Remove unused deps and revise requirements (#5341) 10 months ago
Frank Lee 027aa1043f
[doc] updated inference readme (#5343) 10 months ago
yuehuayingxueluo 8daee26989 [Inference] Add the logic of the inference engine (#5173) 11 months ago
Zhongkai Zhao 64519eb830
[doc] Update required third-party library list for testing and torch comptibility checking (#5207) 11 months ago
Hongxin Liu 1cd7efc520
[inference] refactor examples and fix schedule (#5077) 1 year ago
Xu Kai fb103cfd6e
[inference] update examples and engine (#5073) 1 year ago
Cuiqing Li (李崔卿) bce919708f
[Kernels]added flash-decoidng of triton (#5063) 1 year ago
Xu Kai fd6482ad8c
[inference] Refactor inference architecture (#5057) 1 year ago
Cuiqing Li (李崔卿) 28052a71fb
[Kernels]Update triton kernels into 2.1.0 (#5046) 1 year ago
Jianghai cf579ff46d
[Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
Cuiqing Li 3a41e8304e
[Refactor] Integrated some lightllm kernels into token-attention (#4946) 1 year ago
Xu Kai 946ab56c48
[feature] add gptq for inference (#4754) 1 year ago
flybird11111 7486ed7d3a
[shardformer] update llama2/opt finetune example and fix llama2 policy (#4645) 1 year ago
Ying Liu 9f852f2489 keep requirements same with main branch 1 year ago
yingliu-hpc 1467e3b41b
[coati] add chatglm model (#4539) 1 year ago
ver217 922302263b [misc] update requirements 1 year ago
flybird1111 d2cd48e0be [shardformer] test all optimizations (#4399) 1 year ago
flybird1111 906426cb44 [Shardformer] Merge flash attention branch to pipeline branch (#4362) 1 year ago
Hongxin Liu d921ce8391 [shardformer] support inplace sharding (#4251) 1 year ago
flybird1111 458ae331ad
[kernel] updated unittests for coloattention (#4389) 1 year ago
binmakeswell 089c365fa0
[doc] add Series A Funding and NeurIPS news (#4377) 1 year ago
Hongxin Liu fc5cef2c79
[lazy] support init on cuda (#4269) 1 year ago
wukong1992 c1c672d0f0 [shardformer] shardformer support t5 model (#3994) 1 year ago
Frank Lee 84500b7799
[workflow] fixed testmon cache in build CI (#3806) 2 years ago
Hongxin Liu afb239bbf8
[devops] update torch version of CI (#3725) 2 years ago
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452) 2 years ago
Frank Lee 1beb85cc25
[checkpoint] refactored the API and added safetensors support (#3427) 2 years ago
アマデウス e78a1e949a
fix torch 2.0 compatibility (#3346) 2 years ago
CsRic 052b03e83f
limit torch version (#3213) 2 years ago
HELSON 1216d1e7bd
[tests] diffuser models in model zoo (#3136) 2 years ago
Frank Lee 93fdd35b5e
[build] fixed the doc build process (#2618) 2 years ago
Frank Lee 8518263b80
[test] fixed the triton version for testing (#2608) 2 years ago
Frank Lee 53bb8682a2
[worfklow] added coverage test (#2399) 2 years ago
Jiarui Fang bc0e271e71
[buider] use builder() for cpu adam and fused optim in setup.py (#2187) 2 years ago
Frank Lee 81e0da7fa8
[setup] supported conda-installed torch (#2048) 2 years ago
Jiarui Fang 6fa71d65d3
[fx] skip diffusers unitest if it is not installed (#1799) 2 years ago
Super Daniel 5ea89f6456
[CI] downgrade fbgemm. (#1778) 2 years ago
oahzxl 25952b67d7
[feat] add flash attention (#1762) 2 years ago
Super Daniel b893342f95
[fx] test tracer on diffuser modules. (#1750) 2 years ago
Jiarui Fang 504419d261
[FAW] add cache manager for the cached embedding (#1419) 2 years ago
Super Daniel be229217ce
[fx] add torchaudio test (#1369) 2 years ago
Boyuan Yao bb640ec728
[fx] Add colotracer compatibility test on torchrec (#1370) 2 years ago
Frank Lee b2475d8c5c
[fx] fixed unit tests for torch 1.12 (#1327) 2 years ago
YuliangLiu0306 9feff0f760
[titans]remove model zoo (#1042) 3 years ago
Frank Lee cf6d1c9284
[CLI] refactored the launch CLI and fixed bugs in multi-node launching (#844) 3 years ago