Commit Graph

192 Commits (main)

Author SHA1 Message Date
flybird11111 6a21f96a87
[doc] update advanced tutorials, training gpt with hybrid parallelism (#4866)
1 year ago
Zhongkai Zhao db40e086c8 [test] modify model supporting part of low_level_zero plugin (including correspoding docs)
1 year ago
binmakeswell 822051d888
[doc] update slack link (#4823)
1 year ago
Hongxin Liu da15fdb9ca
[doc] add lazy init docs (#4808)
1 year ago
Baizhou Zhang 64a08b2dc3
[checkpointio] support unsharded checkpointIO for hybrid parallel (#4774)
1 year ago
Baizhou Zhang a2db75546d
[doc] polish shardformer doc (#4779)
1 year ago
binmakeswell d512a4d38d
[doc] add llama2 domain-specific solution news (#4789)
1 year ago
Baizhou Zhang 493a5efeab
[doc] add shardformer doc to sidebar (#4768)
1 year ago
Hongxin Liu 66f3926019
[doc] clean up outdated docs (#4765)
1 year ago
Pengtai Xu 4d7537ba25 [doc] put native colossalai plugins first in description section
1 year ago
Pengtai Xu e10d9f087e [doc] add model examples for each plugin
1 year ago
Pengtai Xu a04337bfc3 [doc] put individual plugin explanation in front
1 year ago
Pengtai Xu 10513f203c [doc] explain suitable use case for each plugin
1 year ago
Hongxin Liu b5f9e37c70
[legacy] clean up legacy code (#4743)
1 year ago
Baizhou Zhang d151dcab74
[doc] explaination of loading large pretrained models (#4741)
1 year ago
Baizhou Zhang 451c3465fb
[doc] polish shardformer doc (#4735)
1 year ago
Bin Jia 6a03c933a0
[shardformer] update seq parallel document (#4730)
1 year ago
flybird11111 46162632e5
[shardformer] update pipeline parallel document (#4725)
1 year ago
Baizhou Zhang 50e5602c2d
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
1 year ago
github-actions[bot] 8c2dda7410
[format] applied code formatting on changed files in pull request 4726 (#4727)
1 year ago
Baizhou Zhang f911d5b09d
[doc] Add user document for Shardformer (#4702)
1 year ago
binmakeswell ce97790ed7
[doc] fix llama2 code link (#4726)
1 year ago
Baizhou Zhang 1d454733c4
[doc] Update booster user documents. (#4669)
1 year ago
Hongxin Liu 554aa9592e
[legacy] move communication and nn to legacy and refactor logger (#4671)
1 year ago
Hongxin Liu ac178ca5c1 [legacy] move builder and registry to legacy (#4603)
1 year ago
Hongxin Liu 8accecd55b [legacy] move engine to legacy (#4560)
1 year ago
Hongxin Liu 89fe027787 [legacy] move trainer to legacy (#4545)
1 year ago
binmakeswell 7a978eb3d0
[DOC] hotfix/llama2news (#4595)
1 year ago
Hongxin Liu 27061426f7
[gemini] improve compatibility and add static placement policy (#4479)
1 year ago
binmakeswell 089c365fa0
[doc] add Series A Funding and NeurIPS news (#4377)
1 year ago
flybird1111 f40b718959
[doc] Fix gradient accumulation doc. (#4349)
1 year ago
Baizhou Zhang c6f6005990
[checkpointio] Sharded Optimizer Checkpoint for Gemini Plugin (#4302)
1 year ago
binmakeswell 7ff11b5537
[example] add llama pretraining (#4257)
1 year ago
Jianghai 711e2b4c00
[doc] update and revise some typos and errs in docs (#4107)
1 year ago
digger yu 769cddcb2c
fix typo docs/ (#4033)
1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083)
1 year ago
Frank Lee ddcf58cacf
Revert "[sync] sync feature/shardformer with develop"
1 year ago
FoolPlayer 24651fdd4f
Merge pull request #3931 from FrankLeeeee/sync/develop-to-shardformer
1 year ago
digger yu 33eef714db
fix typo examples and docs (#3932)
1 year ago
Hongxin Liu 12c90db3f3
[doc] add lazy init tutorial (#3922)
1 year ago
Baizhou Zhang c1535ccbba
[doc] fix docs about booster api usage (#3898)
1 year ago
jiangmingyan 07cb21142f
[doc]update moe chinese document. (#3890)
1 year ago
jiangmingyan 281b33f362
[doc] update document of zero with chunk. (#3855)
2 years ago
jiangmingyan b0474878bf
[doc] update nvme offload documents. (#3850)
2 years ago
jiangmingyan a64df3fa97
[doc] update document of gemini instruction. (#3842)
2 years ago
Frank Lee 54e97ed7ea
[workflow] supported test on CUDA 10.2 (#3841)
2 years ago
wukong1992 3229f93e30
[booster] add warning for torch fsdp plugin doc (#3833)
2 years ago
digger yu 518b31c059
[docs] change placememt_policy to placement_policy (#3829)
2 years ago
digger yu e90fdb1000 fix typo docs/
2 years ago
jiangmingyan 725365f297
Merge pull request #3810 from jiangmingyan/amp
2 years ago
jiangmingyan 278fcbc444 [doc]fix
2 years ago
jiangmingyan 8aa1fb2c7f [doc]fix
2 years ago
Hongxin Liu 19d153057e
[doc] add warning about fsdp plugin (#3813)
2 years ago
jiangmingyan c425a69d52 [doc] add removed change of config.py
2 years ago
jiangmingyan 75272ef37b [doc] add removed warning
2 years ago
Mingyan Jiang a520610bd9 [doc] update amp document
2 years ago
Mingyan Jiang 1167bf5b10 [doc] update amp document
2 years ago
Mingyan Jiang 8c62e50dbb [doc] update amp document
2 years ago
jiangmingyan ef02d7ef6d
[doc] update gradient accumulation (#3771)
2 years ago
github-actions[bot] 62c7e67f9f
[format] applied code formatting on changed files in pull request 3786 (#3787)
2 years ago
jiangmingyan fe1561a884
[doc] update gradient cliping document (#3778)
2 years ago
Yanjia0 d9393b85f1
[doc] add deprecated warning on doc Basics section (#3754)
2 years ago
Hongxin Liu 72688adb2f
[doc] add booster docstring and fix autodoc (#3789)
2 years ago
Hongxin Liu 60e6a154bc
[doc] add tutorial for booster checkpoint (#3785)
2 years ago
binmakeswell ad2cf58f50
[chat] add performance and tutorial (#3786)
2 years ago
Hongxin Liu 21e29e2212
[doc] add tutorial for booster plugins (#3758)
2 years ago
Hongxin Liu 5ce6c9d86f
[doc] add tutorial for cluster utils (#3763)
2 years ago
jiangmingyan 48bd056761
[doc] update hybrid parallelism doc (#3770)
2 years ago
jiangmingyan d449525acf
[doc] update booster tutorials (#3718)
2 years ago
Hongxin Liu 5dd573c6b6
[devops] fix ci for document check (#3751)
2 years ago
digger-yu b9a8dff7e5
[doc] Fix typo under colossalai and doc(#3618)
2 years ago
digger-yu 9edeadfb24
[doc] Update 1D_tensor_parallel.md (#3573)
2 years ago
digger-yu 1c7734bc94
[doc] Update 1D_tensor_parallel.md (#3563)
2 years ago
digger-yu a3ac48ef3d
[doc] Update README-zh-Hans.md (#3541)
2 years ago
binmakeswell 0c0455700f
[doc] add requirement and highlight application (#3516)
2 years ago
Frank Lee 4e9989344d
[doc] updated contributor list (#3474)
2 years ago
Frank Lee 80eba05b0a
[test] refactor tests with spawn (#3452)
2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424)
2 years ago
binmakeswell 15a74da79c
[doc] add Intel cooperation news (#3333)
2 years ago
binmakeswell 31c78f2be3
[doc] add ColossalChat news (#3304)
2 years ago
binmakeswell 682af61396
[doc] add ColossalChat (#3297)
2 years ago
Saurav Maheshkar 20d1c99444
[refactor] update docs (#3174)
2 years ago
Frank Lee 3213347b49
[doc] fixed typos in docs/README.md (#3082)
2 years ago
Frank Lee 416a50dbd7
[doc] moved doc test command to bottom (#3075)
2 years ago
Frank Lee ea0b52c12e
[doc] specified operating system requirement (#3019)
2 years ago
ver217 378d827c6b
[doc] update nvme offload doc (#3014)
2 years ago
Frank Lee 8fedc8766a
[workflow] supported conda package installation in doc test (#3028)
2 years ago
Frank Lee e0a1c1321c
[doc] added reference to related works (#2994)
2 years ago
github-actions[bot] dca98937f8
[format] applied code formatting on changed files in pull request 2933 (#2939)
2 years ago
binmakeswell 8264cd7ef1
[doc] add env scope (#2933)
2 years ago
Frank Lee b8804aa60c
[doc] added readme for documentation (#2935)
2 years ago
Frank Lee 9e3b8b7aff
[doc] removed read-the-docs (#2932)
2 years ago
Frank Lee 77b88a3849
[workflow] added auto doc test on PR (#2929)
2 years ago
binmakeswell 0afb55fc5b
[doc] add os scope, update tutorial install and tips (#2914)
2 years ago
YuliangLiu0306 cf6409dd40
Hotfix/auto parallel zh doc (#2820)
2 years ago
YuliangLiu0306 2059fdd6b0
[hotfix] add copyright for solver and device mesh (#2803)
2 years ago
Frank Lee e376954305
[doc] add opt service doc (#2747)
2 years ago
Frank Lee 5479fdd5b8
[doc] updated documentation version list (#2730)
2 years ago
Frank Lee 2045d45ab7
[doc] updated documentation version list (#2715)
2 years ago
Frank Lee 0966008839
[dooc] fixed the sidebar itemm key (#2672)
2 years ago