.. |
kit
|
[workflow] fixed oom tests (#5275)
|
2024-01-16 18:55:13 +08:00 |
test_analyzer
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_auto_parallel
|
[npu] change device to accelerator api (#5239)
|
2024-01-09 10:20:05 +08:00 |
test_autochunk
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_booster
|
[hotfix] fix 3d plugin test (#5292)
|
2024-01-22 15:19:04 +08:00 |
test_checkpoint_io
|
[feat] refactored extension module (#5298)
|
2024-01-25 17:01:48 +08:00 |
test_cluster
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_config
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_device
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_fx
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_gptq
|
[feature] add gptq for inference (#4754)
|
2023-09-22 11:02:50 +08:00 |
test_infer
|
[inference] removed redundancy init_batch (#5353)
|
2024-02-02 11:44:15 +08:00 |
test_infer_ops/triton
|
[Inference]Repalce Attention layer and MLP layer by shardformer to optimize the weight transpose operation,add fused_qkv and fused linear_add (#5340)
|
2024-02-01 15:49:39 +08:00 |
test_lazy
|
[workflow] fixed oom tests (#5275)
|
2024-01-16 18:55:13 +08:00 |
test_legacy
|
[npu] change device to accelerator api (#5239)
|
2024-01-09 10:20:05 +08:00 |
test_moe
|
[npu] change device to accelerator api (#5239)
|
2024-01-09 10:20:05 +08:00 |
test_optimizer
|
[feat] refactored extension module (#5298)
|
2024-01-25 17:01:48 +08:00 |
test_pipeline
|
Merge branch 'main' into sync/npu
|
2024-01-18 12:05:21 +08:00 |
test_shardformer
|
[hotfix] Fix ShardFormer test execution path when using sequence parallelism (#5230)
|
2024-01-17 17:42:29 +08:00 |
test_smoothquant
|
[inference] Add smmoothquant for llama (#4904)
|
2023-10-16 11:28:44 +08:00 |
test_tensor
|
[misc] update pre-commit and run all files (#4752)
|
2023-09-19 14:20:26 +08:00 |
test_utils
|
[feat] refactored extension module (#5298)
|
2024-01-25 17:01:48 +08:00 |
test_zero
|
[npu] change device to accelerator api (#5239)
|
2024-01-09 10:20:05 +08:00 |
__init__.py
|
[zero] Update sharded model v2 using sharded param v2 (#323)
|
2022-03-11 15:50:28 +08:00 |