You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ColossalAI/tests
Xuanlei Zhao f71e63b0f3
[moe] support optimizer checkpoint (#5015)
1 year ago
..
kit [Inference] Dynamic Batching Inference, online and offline (#4953) 1 year ago
test_analyzer [misc] update pre-commit and run all files (#4752) 1 year ago
test_auto_parallel [misc] update pre-commit and run all files (#4752) 1 year ago
test_autochunk [misc] update pre-commit and run all files (#4752) 1 year ago
test_booster [gemini] support gradient accumulation (#4869) 1 year ago
test_checkpoint_io [hotfix] fix lr scheduler bug in torch 2.0 (#4864) 1 year ago
test_cluster [misc] update pre-commit and run all files (#4752) 1 year ago
test_config [misc] update pre-commit and run all files (#4752) 1 year ago
test_device [misc] update pre-commit and run all files (#4752) 1 year ago
test_fx [misc] update pre-commit and run all files (#4752) 1 year ago
test_gptq [feature] add gptq for inference (#4754) 1 year ago
test_infer [Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014) 1 year ago
test_infer_ops/triton [moe] merge moe into main (#4978) 1 year ago
test_lazy [lazy] support from_pretrained (#4801) 1 year ago
test_legacy [test] merge old components to test to model zoo (#4945) 1 year ago
test_moe [moe] support optimizer checkpoint (#5015) 1 year ago
test_optimizer [test] merge old components to test to model zoo (#4945) 1 year ago
test_pipeline [misc] update pre-commit and run all files (#4752) 1 year ago
test_shardformer [hotfix] Add layer norm gradients all-reduce for sequence parallel (#4926) 1 year ago
test_smoothquant [inference] Add smmoothquant for llama (#4904) 1 year ago
test_tensor [misc] update pre-commit and run all files (#4752) 1 year ago
test_utils [misc] update pre-commit and run all files (#4752) 1 year ago
test_zero [hotfix] fix grad accumulation plus clipping for gemini (#5002) 1 year ago
__init__.py [zero] Update sharded model v2 using sharded param v2 (#323) 3 years ago