.. |
components_to_test
|
[zero] adapt zero for unsharded paramters (Optimizer part) (#601)
|
2022-04-01 20:10:47 +08:00 |
test_amp
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_comm
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_config
|
[test] removed trivial outdated test
|
2022-04-12 11:08:15 +08:00 |
test_context
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_data
|
[refactor] moving memtracer to gemini (#801)
|
2022-04-19 10:13:08 +08:00 |
test_data_pipeline_tensor_parallel
|
[refactor] moving grad acc logic to engine (#804)
|
2022-04-19 14:03:21 +08:00 |
test_engine
|
[refactor] moving grad acc logic to engine (#804)
|
2022-04-19 14:03:21 +08:00 |
test_gemini
|
[refactor] moving memtracer to gemini (#801)
|
2022-04-19 10:13:08 +08:00 |
test_layers
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_moe
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_optimizer
|
[zero]added hybrid adam, removed loss scale in adam (#527)
|
2022-03-25 18:03:54 +08:00 |
test_trainer
|
[test] refactored with the new rerun decorator (#763)
|
2022-04-15 00:33:04 +08:00 |
test_utils
|
[refactor] moving grad acc logic to engine (#804)
|
2022-04-19 14:03:21 +08:00 |
test_zero
|
[zero] add ZeroTensorShardStrategy (#793)
|
2022-04-19 14:32:45 +08:00 |
__init__.py
|
[zero] Update sharded model v2 using sharded param v2 (#323)
|
2022-03-11 15:50:28 +08:00 |