33 Commits (5b5fbcff09092ccecf54dde05dc6ee25235d98b2)

Author SHA1 Message Date
flybird11111 295dd2d9fe
[zerobubble] rebase main (#6075) 2 months ago
Wenxuan Tan d383449fc4
[CI] Remove triton version for compatibility bug; update req torch >=2.2 (#6018) 3 months ago
Wang Binluo eea37da6fa
[fp8] Merge feature/fp8_comm to main branch of Colossalai (#6016) 3 months ago
Hongxin Liu 26493b97d3
[misc] update compatibility (#6008) 3 months ago
flybird11111 0c10afd372
[FP8] rebase main (#5963) 4 months ago
Hongxin Liu 27a72f0de1 [misc] support torch2.3 (#5893) 4 months ago
Runyu Lu cba20525a8
[Feat] Diffusion Model(PixArtAlpha/StableDiffusion3) Support (#5838) 5 months ago
flybird11111 2ddf624a86
[shardformer] upgrade transformers to 4.39.3 (#5815) 5 months ago
Hongxin Liu 5ead00ffc5
[misc] update requirements (#5787) 6 months ago
Jianghai 85946d4236
[Inference]Fix readme and example for API server (#5742) 6 months ago
Edenzzzz 43995ee436
[Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694) 6 months ago
Runyu Lu 18d67d0e8e
[Feat]Inference RPC Server Support (#5705) 6 months ago
linsj20 91fa553775 [Feature] qlora support (#5586) 7 months ago
flybird11111 8954a0c2e2 [LowLevelZero] low level zero support lora (#5153) 7 months ago
Wang Binluo 0d0a582033
[shardformer] update transformers (#5583) 7 months ago
Hongxin Liu 3788fefc7a
[zero] support multiple (partial) backward passes (#5596) 7 months ago
Frank Lee 027aa1043f
[doc] updated inference readme (#5343) 10 months ago
Jianghai cf579ff46d
[Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
Cuiqing Li 3a41e8304e
[Refactor] Integrated some lightllm kernels into token-attention (#4946) 1 year ago
ver217 922302263b [misc] update requirements 1 year ago
flybird1111 458ae331ad
[kernel] updated unittests for coloattention (#4389) 1 year ago
binmakeswell 089c365fa0
[doc] add Series A Funding and NeurIPS news (#4377) 1 year ago
Frank Lee 1beb85cc25
[checkpoint] refactored the API and added safetensors support (#3427) 2 years ago
アマデウス e78a1e949a
fix torch 2.0 compatibility (#3346) 2 years ago
CsRic 052b03e83f
limit torch version (#3213) 2 years ago
Frank Lee 93fdd35b5e
[build] fixed the doc build process (#2618) 2 years ago
Jiarui Fang bc0e271e71
[buider] use builder() for cpu adam and fused optim in setup.py (#2187) 2 years ago
Frank Lee 81e0da7fa8
[setup] supported conda-installed torch (#2048) 2 years ago
Jiarui Fang 504419d261
[FAW] add cache manager for the cached embedding (#1419) 2 years ago
Frank Lee cf6d1c9284
[CLI] refactored the launch CLI and fixed bugs in multi-node launching (#844) 3 years ago
Frank Lee 01e9f834f5
[dependency] removed torchvision (#833) 3 years ago
Frank Lee 05d9ae5999
[cli] add missing requirement (#805) 3 years ago
Jiarui Fang 54229cd33e
[log] better logging display with rich (#426) 3 years ago
BoxiangW a2f1565672
Update GitHub action and pre-commit settings (#196) 3 years ago
Frank Lee 3defa32aee
Support TP-compatible Torch AMP and Update trainer API (#27) 3 years ago
zbian 404ecbdcc6 Migrated project 3 years ago