ColossalAI/tests/test_pipeline
Kirigaya Kazuto 9145aef2b4
[pipeline/rpc] implement distributed optimizer | test with assert_close (#1486)
* support p2p communication with any type of object | pass test

* reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test

* [engin/schedule] use p2p_v2 to recontruct pipeline_schedule

* [pipeline/rpc] implement a demo for PP with cuda rpc framework

* [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B

* [pipeline/rpc] implement distributed optimizer | test with assert_close

* [pipeline/rpc] implement distributed optimizer | test with assert_close
2022-08-25 10:49:01 +08:00
..
rpc_test_utils.py [pipeline/rpc] implement distributed optimizer | test with assert_close (#1486) 2022-08-25 10:49:01 +08:00
test_cuda_rpc_optimizer.py [pipeline/rpc] implement distributed optimizer | test with assert_close (#1486) 2022-08-25 10:49:01 +08:00
test_cuda_rpc_pipeline.py [pipeline/rpc] implement distributed optimizer | test with assert_close (#1486) 2022-08-25 10:49:01 +08:00
test_cuda_rpc_value_correctness.py [pipeline/rpc] implement distributed optimizer | test with assert_close (#1486) 2022-08-25 10:49:01 +08:00
test_pipelinable.py [pipeline] refactor the pipeline module (#1087) 2022-06-10 11:27:38 +08:00