179 Commits (363cde695709d8126334f2a6acbe26bcdfbfbdcd)

Author SHA1 Message Date
Edenzzzz 5f8c0a0ac3
[Feature] auto-cast optimizers to distributed version (#5746) 6 months ago
binmakeswell 4647ec28c8
[inference] release (#5747) 6 months ago
binmakeswell 2011b1356a
[misc] Update PyTorch version in docs (#5724) 6 months ago
Edenzzzz 43995ee436
[Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694) 6 months ago
Edenzzzz 785cd9a9c9
[misc] Update PyTorch version in docs (#5711) 6 months ago
Hongxin Liu 7f8b16635b
[misc] refactor launch API and tensor constructor (#5666) 7 months ago
binmakeswell b8a711aa2d
[news] llama3 and open-sora v1.1 (#5655) 7 months ago
Hongxin Liu bbb2c21f16
[shardformer] fix chatglm implementation (#5644) 7 months ago
binmakeswell f4c5aafe29
[example] llama3 (#5631) 7 months ago
Hongxin Liu 641b1ee71a
[devops] remove post commit ci (#5566) 8 months ago
binmakeswell 34e909256c
[release] grok-1 inference benchmark (#5500) 8 months ago
Wenhao Chen bb0a668fee
[hotfix] set return_outputs=False in examples and polish code (#5404) 8 months ago
binmakeswell 6df844b8c4
[release] grok-1 314b inference (#5490) 8 months ago
binmakeswell d158fc0e64
[doc] update open-sora demo (#5479) 8 months ago
binmakeswell bd998ced03
[doc] release Open-Sora 1.0 with model weights (#5468) 8 months ago
digger yu 70cce5cbed
[doc] update some translations with README-zh-Hans.md (#5382) 9 months ago
Hongxin Liu 070df689e6
[devops] fix extention building (#5427) 9 months ago
binmakeswell 822241a99c
[doc] sora release (#5425) 9 months ago
binmakeswell a1c6cdb189 [doc] fix blog link 9 months ago
Frank Lee 705a62a565
[doc] updated installation command (#5389) 9 months ago
yixiaoer 69e3ad01ed
[doc] Fix typo (#5361) 9 months ago
digger yu bce9499ed3
fix some typo (#5307) 10 months ago
Hongxin Liu d202cc28c0
[npu] change device to accelerator api (#5239) 11 months ago
binmakeswell 7bc6969ce6
[doc] SwiftInfer release (#5236) 11 months ago
binmakeswell b9b32b15e6
[doc] add Colossal-LLaMA-2-13B (#5234) 11 months ago
flybird11111 681d9b12ef
[doc] update pytorch version in documents. (#5177) 11 months ago
binmakeswell 177c79f2d1
[doc] add moe news (#5128) 1 year ago
Wenhao Chen 7172459e74
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 1 year ago
digger yu d5661f0f25
[nfc] fix typo change directoty to directory (#5111) 1 year ago
digger yu 2bdf76f1f2
fix typo change lazy_iniy to lazy_init (#5099) 1 year ago
digger yu 0d482302a1
[nfc] fix typo and author name (#5089) 1 year ago
digger yu fd3567e089
[nfc] fix typo in docs/ (#4972) 1 year ago
ppt0011 335cb105e2
[doc] add supported feature diagram for hybrid parallel plugin (#4996) 1 year ago
digger yu 11009103be
[nfc] fix some typo with colossalai/ docs/ etc. (#4920) 1 year ago
Baizhou Zhang 21ba89cab6
[gemini] support gradient accumulation (#4869) 1 year ago
flybird11111 6a21f96a87
[doc] update advanced tutorials, training gpt with hybrid parallelism (#4866) 1 year ago
Zhongkai Zhao db40e086c8 [test] modify model supporting part of low_level_zero plugin (including correspoding docs) 1 year ago
binmakeswell 822051d888
[doc] update slack link (#4823) 1 year ago
Hongxin Liu da15fdb9ca
[doc] add lazy init docs (#4808) 1 year ago
Baizhou Zhang 64a08b2dc3
[checkpointio] support unsharded checkpointIO for hybrid parallel (#4774) 1 year ago
Baizhou Zhang a2db75546d
[doc] polish shardformer doc (#4779) 1 year ago
binmakeswell d512a4d38d
[doc] add llama2 domain-specific solution news (#4789) 1 year ago
Baizhou Zhang 493a5efeab
[doc] add shardformer doc to sidebar (#4768) 1 year ago
Hongxin Liu 66f3926019
[doc] clean up outdated docs (#4765) 1 year ago
Pengtai Xu 4d7537ba25 [doc] put native colossalai plugins first in description section 1 year ago
Pengtai Xu e10d9f087e [doc] add model examples for each plugin 1 year ago
Pengtai Xu a04337bfc3 [doc] put individual plugin explanation in front 1 year ago
Pengtai Xu 10513f203c [doc] explain suitable use case for each plugin 1 year ago
Hongxin Liu b5f9e37c70
[legacy] clean up legacy code (#4743) 1 year ago
Baizhou Zhang d151dcab74
[doc] explaination of loading large pretrained models (#4741) 1 year ago