Making large AI models cheaper, faster and more accessible
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
傅剑寒 bfad39357b
[Inference/Feat] Add quant kvcache interface (#5700)
7 months ago
..
_C
_analyzer
accelerator
amp
auto_parallel [misc] refactor launch API and tensor constructor (#5666) 7 months ago
autochunk
booster [misc] refactor launch API and tensor constructor (#5666) 7 months ago
checkpoint_io [lora] add lora APIs for booster, support lora for TorchDDP (#4981) 7 months ago
cli
cluster
context [Fix]: implement thread-safety singleton to avoid deadlock for very large-scale training scenarios (#5625) 7 months ago
device
fx
inference [Inference/Feat] Add quant kvcache interface (#5700) 7 months ago
interface
kernel resolve rebase conflicts on Branch feat/online-serving 7 months ago
lazy
legacy [sync] resolve conflicts of merging main 7 months ago
logging
moe
nn [misc] refactor launch API and tensor constructor (#5666) 7 months ago
pipeline [LowLevelZero] low level zero support lora (#5153) 7 months ago
quantization [Feature] qlora support (#5586) 7 months ago
shardformer [Inference] Fix bugs and docs for feat/online-server (#5598) 7 months ago
tensor [misc] refactor launch API and tensor constructor (#5666) 7 months ago
testing
utils
zero [Feature] qlora support (#5586) 7 months ago
__init__.py
initialize.py [misc] refactor launch API and tensor constructor (#5666) 7 months ago