Commit Graph

48 Commits (10a19e22c63aa9963a889874b63c47ccd0e6db42)

Author SHA1 Message Date
Edenzzzz 5f8c0a0ac3
[Feature] auto-cast optimizers to distributed version (#5746)
6 months ago
Edenzzzz 43995ee436
[Feature] Distributed optimizers: Lamb, Galore, CAME and Adafactor (#5694)
7 months ago
Hongxin Liu 7f8b16635b
[misc] refactor launch API and tensor constructor (#5666)
7 months ago
Hongxin Liu bbb2c21f16
[shardformer] fix chatglm implementation (#5644)
7 months ago
Wenhao Chen bb0a668fee
[hotfix] set return_outputs=False in examples and polish code (#5404)
8 months ago
Wenhao Chen 7172459e74
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088)
1 year ago
digger yu 2bdf76f1f2
fix typo change lazy_iniy to lazy_init (#5099)
1 year ago
digger yu 0d482302a1
[nfc] fix typo and author name (#5089)
1 year ago
Baizhou Zhang 21ba89cab6
[gemini] support gradient accumulation (#4869)
1 year ago
Hongxin Liu da15fdb9ca
[doc] add lazy init docs (#4808)
1 year ago
Baizhou Zhang a2db75546d
[doc] polish shardformer doc (#4779)
1 year ago
Hongxin Liu 66f3926019
[doc] clean up outdated docs (#4765)
1 year ago
Baizhou Zhang 451c3465fb
[doc] polish shardformer doc (#4735)
1 year ago
Bin Jia 6a03c933a0
[shardformer] update seq parallel document (#4730)
1 year ago
flybird11111 46162632e5
[shardformer] update pipeline parallel document (#4725)
1 year ago
Baizhou Zhang 50e5602c2d
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
1 year ago
Baizhou Zhang f911d5b09d
[doc] Add user document for Shardformer (#4702)
1 year ago
Hongxin Liu ac178ca5c1 [legacy] move builder and registry to legacy (#4603)
1 year ago
Hongxin Liu 8accecd55b [legacy] move engine to legacy (#4560)
1 year ago
Hongxin Liu 89fe027787 [legacy] move trainer to legacy (#4545)
1 year ago
Hongxin Liu 27061426f7
[gemini] improve compatibility and add static placement policy (#4479)
1 year ago
flybird1111 f40b718959
[doc] Fix gradient accumulation doc. (#4349)
1 year ago
Jianghai 711e2b4c00
[doc] update and revise some typos and errs in docs (#4107)
1 year ago
Baizhou Zhang 4da324cd60
[hotfix]fix argument naming in docs and examples (#4083)
1 year ago
Frank Lee ddcf58cacf
Revert "[sync] sync feature/shardformer with develop"
1 year ago
Hongxin Liu 12c90db3f3
[doc] add lazy init tutorial (#3922)
2 years ago
Baizhou Zhang c1535ccbba
[doc] fix docs about booster api usage (#3898)
2 years ago
jiangmingyan 281b33f362
[doc] update document of zero with chunk. (#3855)
2 years ago
jiangmingyan b0474878bf
[doc] update nvme offload documents. (#3850)
2 years ago
digger yu 518b31c059
[docs] change placememt_policy to placement_policy (#3829)
2 years ago
digger yu e90fdb1000 fix typo docs/
2 years ago
jiangmingyan 278fcbc444 [doc]fix
2 years ago
jiangmingyan 8aa1fb2c7f [doc]fix
2 years ago
jiangmingyan 75272ef37b [doc] add removed warning
2 years ago
Mingyan Jiang a520610bd9 [doc] update amp document
2 years ago
Mingyan Jiang 8c62e50dbb [doc] update amp document
2 years ago
jiangmingyan ef02d7ef6d
[doc] update gradient accumulation (#3771)
2 years ago
jiangmingyan fe1561a884
[doc] update gradient cliping document (#3778)
2 years ago
Hongxin Liu 72688adb2f
[doc] add booster docstring and fix autodoc (#3789)
2 years ago
Hongxin Liu 5ce6c9d86f
[doc] add tutorial for cluster utils (#3763)
2 years ago
jiangmingyan 48bd056761
[doc] update hybrid parallelism doc (#3770)
2 years ago
digger-yu b9a8dff7e5
[doc] Fix typo under colossalai and doc(#3618)
2 years ago
digger-yu 9edeadfb24
[doc] Update 1D_tensor_parallel.md (#3573)
2 years ago
ver217 26b7aac0be
[zero] reorganize zero/gemini folder structure (#3424)
2 years ago
Frank Lee 416a50dbd7
[doc] moved doc test command to bottom (#3075)
2 years ago
ver217 378d827c6b
[doc] update nvme offload doc (#3014)
2 years ago
Frank Lee e0a1c1321c
[doc] added reference to related works (#2994)
2 years ago
Frank Lee 85b2303b55
[doc] migrate the markdown files (#2652)
2 years ago