.. |
components_to_test
|
[zero] new interface for ShardedOptimv2 (#406)
|
2022-03-14 20:48:41 +08:00 |
test_amp
|
[polish] use GLOBAL_MODEL_DATA_TRACER (#417)
|
2022-03-15 11:29:46 +08:00 |
test_comm
|
Hotfix/Colossalai layers (#92)
|
2021-12-29 23:32:10 +08:00 |
test_config
|
[profiler] primary memory tracer
|
2022-03-11 15:50:28 +08:00 |
test_context
|
Optimize pipeline schedule (#94)
|
2021-12-30 15:56:46 +08:00 |
test_data
|
added CI for unit testing (#69)
|
2021-12-16 10:32:08 +08:00 |
test_data_pipeline_tensor_parallel
|
Optimize pipeline schedule (#94)
|
2021-12-30 15:56:46 +08:00 |
test_engine
|
[zero] new interface for ShardedOptimv2 (#406)
|
2022-03-14 20:48:41 +08:00 |
test_layers
|
fixed padding index issue for vocab parallel embedding layers; updated 3D linear to be compatible with examples in the tutorial
|
2022-03-11 15:50:28 +08:00 |
test_moe
|
Added TPExpert for special situation
|
2022-03-11 15:50:28 +08:00 |
test_optimizer
|
[zero] cpu adam kernel (#288)
|
2022-03-11 15:50:28 +08:00 |
test_trainer
|
[zero] new interface for ShardedOptimv2 (#406)
|
2022-03-14 20:48:41 +08:00 |
test_utils
|
[zero] memtracer to record cuda memory usage of model data and overall system (#395)
|
2022-03-14 22:05:30 +08:00 |
test_zero_data_parallel
|
[zero] cuda margin space for OS (#418)
|
2022-03-15 12:02:19 +08:00 |
test_zero_tensor_parallel
|
Feature/zero (#279)
|
2022-03-11 15:50:28 +08:00 |
__init__.py
|
[zero] Update sharded model v2 using sharded param v2 (#323)
|
2022-03-11 15:50:28 +08:00 |