..
1D_tensor_parallel.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
2D_tensor_parallel.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
2p5D_tensor_parallel.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
3D_tensor_parallel.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
cluster_utils.md
[doc] add booster docstring and fix autodoc ( #3789 )
2023-05-22 10:56:47 +08:00
gradient_accumulation_with_booster.md
[gemini] support gradient accumulation ( #4869 )
2023-10-17 14:07:21 +08:00
gradient_clipping_with_booster.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
lazy_init.md
fix typo change lazy_iniy to lazy_init ( #5099 )
2023-11-24 19:15:59 +08:00
mixed_precision_training_with_booster.md
[doc] clean up outdated docs ( #4765 )
2023-09-21 11:36:20 +08:00
nvme_offload.md
[doc] update nvme offload documents. ( #3850 )
2023-05-26 01:22:01 +08:00
pipeline_parallel.md
[hotfix] set return_outputs=False in examples and polish code ( #5404 )
2024-03-25 12:31:09 +08:00
shardformer.md
[hotfix] set return_outputs=False in examples and polish code ( #5404 )
2024-03-25 12:31:09 +08:00
zero_with_chunk.md
[nfc] fix typo and author name ( #5089 )
2023-11-22 10:39:01 +08:00