ColossalAI

History

Baizhou Zhang f911d5b09d [doc] Add user document for Shardformer (#4702 ) * create shardformer doc files * add docstring for seq-parallel * update ShardConfig docstring * add links to llama example * add outdated massage * finish introduction & supporting information * finish 'how shardformer works' * finish shardformer.md English doc * fix doctest fail * add Chinese document		2023-09-15 10:56:39 +08:00
..
1D_tensor_parallel.md	[doc] Add user document for Shardformer (#4702 )	2023-09-15 10:56:39 +08:00
2D_tensor_parallel.md	[doc] update hybrid parallelism doc (#3770 )	2023-05-18 14:16:13 +08:00
2p5D_tensor_parallel.md	[doc] update hybrid parallelism doc (#3770 )	2023-05-18 14:16:13 +08:00
3D_tensor_parallel.md	fix typo docs/	2023-05-24 13:57:43 +08:00
cluster_utils.md	[doc] add booster docstring and fix autodoc (#3789 )	2023-05-22 10:56:47 +08:00
gradient_accumulation.md	[doc] update gradient accumulation (#3771 )	2023-05-23 10:52:30 +08:00
gradient_accumulation_with_booster.md	[doc] Fix gradient accumulation doc. (#4349 )	2023-08-04 17:24:35 +08:00
gradient_clipping.md	[doc] update gradient cliping document (#3778 )	2023-05-22 14:13:15 +08:00
gradient_clipping_with_booster.md	fix typo docs/	2023-05-24 13:57:43 +08:00
gradient_handler.md	[legacy] move builder and registry to legacy (#4603 )	2023-09-05 21:53:10 +08:00
mixed_precision_training.md	[legacy] move trainer to legacy (#4545 )	2023-09-05 21:53:10 +08:00
mixed_precision_training_with_booster.md	[doc] update and revise some typos and errs in docs (#4107 )	2023-06-28 19:30:37 +08:00
nvme_offload.md	[doc] update nvme offload documents. (#3850 )	2023-05-26 01:22:01 +08:00
pipeline_parallel.md	[legacy] move trainer to legacy (#4545 )	2023-09-05 21:53:10 +08:00
shardformer.md	[doc] Add user document for Shardformer (#4702 )	2023-09-15 10:56:39 +08:00
zero_with_chunk.md	[gemini] improve compatibility and add static placement policy (#4479 )	2023-08-24 09:29:25 +08:00