.. |
common_utils
|
[colotensor] add Tensor.view op and its unit test (#1343)
|
2022-07-21 10:53:15 +08:00 |
core
|
[colotensor] add Tensor.view op and its unit test (#1343)
|
2022-07-21 10:53:15 +08:00 |
model
|
[hotfix] fix megatron_init in test_gpt2.py (#1357)
|
2022-07-25 10:28:19 +08:00 |
test_chunk.py
|
[hotfix] ZeroDDP use new process group (#1333)
|
2022-07-18 14:14:52 +08:00 |
test_colo_checkpoint_tools.py
|
[colotensor] use cpu memory to store state_dict (#1367)
|
2022-07-26 14:13:38 +08:00 |
test_comm_spec_apply.py
|
[tensor] use communication autograd func (#1617)
|
2022-09-23 13:31:15 +08:00 |
test_context.py
|
[refactory] add nn.parallel module (#1068)
|
2022-06-06 15:34:41 +08:00 |
test_parameter.py
|
[refactor] refactor ColoTensor's unit tests (#1340)
|
2022-07-19 15:46:24 +08:00 |
test_shape_consistency.py
|
[tensor] use communication autograd func (#1617)
|
2022-09-23 13:31:15 +08:00 |
test_shape_consistency_apply.py
|
[tensor]add 1D device mesh (#1492)
|
2022-08-25 16:48:12 +08:00 |
test_sharded_linear.py
|
[tensor] added linear implementation for the new sharding spec (#1416)
|
2022-08-12 11:33:09 +08:00 |
test_sharding_spec.py
|
[tensor]build sharding spec to replace distspec in future. (#1405)
|
2022-08-08 11:15:57 +08:00 |
test_zero_optim.py
|
[unit test] add megatron init test in zero_optim (#1358)
|
2022-07-25 11:18:08 +08:00 |