ColossalAI/tests
Baizhou Zhang 44eab2b27f
[shardformer] support sharded checkpoint IO for models of HybridParallelPlugin (#4506)
* add APIs

* implement save_sharded_model

* add test for hybrid checkpointio

* implement naive loading for sharded model

* implement efficient sharded model loading

* open a new file for hybrid checkpoint_io

* small fix

* fix circular importing

* fix docstring

* arrange arguments and apis

* small fix
2023-08-25 22:04:57 +08:00
..
components_to_test [CI] fix typo with tests/ etc. (#3727) 2023-05-11 16:30:58 +08:00
kit [shardformer] chatglm support sequence parallel (#4482) 2023-08-22 23:59:31 +08:00
test_amp [test] refactor tests with spawn (#3452) 2023-04-06 14:51:35 +08:00
test_analyzer [devops] update torch version of CI (#3725) 2023-05-15 17:20:56 +08:00
test_auto_parallel [gemini] fix argument naming during chunk configuration searching 2023-06-25 13:34:15 +08:00
test_autochunk [test] fixed tests failed due to dtensor change (#4082) 2023-07-04 16:05:01 +08:00
test_booster [Shardformer] Merge flash attention branch to pipeline branch (#4362) 2023-08-15 23:25:14 +08:00
test_checkpoint_io [shardformer] support sharded checkpoint IO for models of HybridParallelPlugin (#4506) 2023-08-25 22:04:57 +08:00
test_cluster [cluster] add process group mesh (#4039) 2023-08-15 23:25:14 +08:00
test_comm [test] refactor tests with spawn (#3452) 2023-04-06 14:51:35 +08:00
test_config [devops] add large-scale distributed test marker (#4452) 2023-08-16 18:56:52 +08:00
test_context [devops] add large-scale distributed test marker (#4452) 2023-08-16 18:56:52 +08:00
test_data [devops] add large-scale distributed test marker (#4452) 2023-08-16 18:56:52 +08:00
test_data_pipeline_tensor_parallel [CI] fix typo with tests/ etc. (#3727) 2023-05-11 16:30:58 +08:00
test_ddp [test] refactor tests with spawn (#3452) 2023-04-06 14:51:35 +08:00
test_device [format] applied code formatting on changed files in pull request 4152 (#4157) 2023-07-04 16:07:47 +08:00
test_engine [test] refactor tests with spawn (#3452) 2023-04-06 14:51:35 +08:00
test_fx [misc] resolve code factor issues (#4433) 2023-08-15 23:25:14 +08:00
test_kernels [Kernels] added triton-implemented of self attention for colossal-ai (#4241) 2023-07-18 23:53:38 +08:00
test_layers [CI] fix typo with tests/ etc. (#3727) 2023-05-11 16:30:58 +08:00
test_lazy [test] skip some not compatible models 2023-08-15 23:25:14 +08:00
test_moe [CI] fix typo with tests/ etc. (#3727) 2023-05-11 16:30:58 +08:00
test_ops [test] refactor tests with spawn (#3452) 2023-04-06 14:51:35 +08:00
test_optimizer [bf16] add bf16 support (#3882) 2023-06-05 15:58:31 +08:00
test_pipeline [shardformer] Pipeline/whisper (#4456) 2023-08-18 21:29:25 +08:00
test_shardformer [shardformer] opt fix. (#4514) 2023-08-25 19:41:24 +08:00
test_tensor [test] fixed tests failed due to dtensor change (#4082) 2023-07-04 16:05:01 +08:00
test_trainer [CI] fix typo with tests/ etc. (#3727) 2023-05-11 16:30:58 +08:00
test_utils [devops] add large-scale distributed test marker (#4452) 2023-08-16 18:56:52 +08:00
test_zero [hotfix] fix unsafe async comm in zero (#4404) 2023-08-11 15:09:24 +08:00
__init__.py