.. |
1D_tensor_parallel.md
|
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
|
2023-09-15 13:52:30 +08:00 |
2D_tensor_parallel.md
|
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
|
2023-09-15 13:52:30 +08:00 |
2p5D_tensor_parallel.md
|
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
|
2023-09-15 13:52:30 +08:00 |
3D_tensor_parallel.md
|
[doc] add shardformer support matrix/update tensor parallel documents (#4728)
|
2023-09-15 13:52:30 +08:00 |
cluster_utils.md
|
[doc] add booster docstring and fix autodoc (#3789)
|
2023-05-22 10:56:47 +08:00 |
gradient_accumulation.md
|
[doc] update gradient accumulation (#3771)
|
2023-05-23 10:52:30 +08:00 |
gradient_accumulation_with_booster.md
|
[doc] Fix gradient accumulation doc. (#4349)
|
2023-08-04 17:24:35 +08:00 |
gradient_clipping.md
|
[doc] update gradient cliping document (#3778)
|
2023-05-22 14:13:15 +08:00 |
gradient_clipping_with_booster.md
|
fix typo docs/
|
2023-05-24 13:57:43 +08:00 |
gradient_handler.md
|
[legacy] move builder and registry to legacy (#4603)
|
2023-09-05 21:53:10 +08:00 |
mixed_precision_training.md
|
[legacy] move trainer to legacy (#4545)
|
2023-09-05 21:53:10 +08:00 |
mixed_precision_training_with_booster.md
|
[doc] update and revise some typos and errs in docs (#4107)
|
2023-06-28 19:30:37 +08:00 |
nvme_offload.md
|
[doc] update nvme offload documents. (#3850)
|
2023-05-26 01:22:01 +08:00 |
pipeline_parallel.md
|
[shardformer] update pipeline parallel document (#4725)
|
2023-09-15 14:32:04 +08:00 |
shardformer.md
|
[doc] polish shardformer doc (#4735)
|
2023-09-15 17:39:10 +08:00 |
zero_with_chunk.md
|
[gemini] improve compatibility and add static placement policy (#4479)
|
2023-08-24 09:29:25 +08:00 |