.. |
1D_tensor_parallel.md
|
[doc] clean up outdated docs (#4765)
|
2023-09-21 11:36:20 +08:00 |
2D_tensor_parallel.md
|
[doc] clean up outdated docs (#4765)
|
2023-09-21 11:36:20 +08:00 |
2p5D_tensor_parallel.md
|
[doc] clean up outdated docs (#4765)
|
2023-09-21 11:36:20 +08:00 |
3D_tensor_parallel.md
|
[doc] clean up outdated docs (#4765)
|
2023-09-21 11:36:20 +08:00 |
cluster_utils.md
|
[doc] add booster docstring and fix autodoc (#3789)
|
2023-05-22 10:56:47 +08:00 |
distributed_optimizers.md
|
[Feature] auto-cast optimizers to distributed version (#5746)
|
2024-05-24 17:24:16 +08:00 |
gradient_accumulation_with_booster.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
gradient_clipping_with_booster.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
lazy_init.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
mixed_precision_training_with_booster.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
nvme_offload.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |
pipeline_parallel.md
|
[hotfix] set return_outputs=False in examples and polish code (#5404)
|
2024-03-25 12:31:09 +08:00 |
shardformer.md
|
[shardformer] fix chatglm implementation (#5644)
|
2024-04-25 14:41:17 +08:00 |
zero_with_chunk.md
|
[misc] refactor launch API and tensor constructor (#5666)
|
2024-04-29 10:40:11 +08:00 |