YuliangLiu0306
|
49216d7ab1
|
[autoparallel] fix bugs caused by negative dim key (#1808)
* [autoparallel] fix bugs caused by negative dim key
* fix import error
* fix matmul test issue
* fix unit test issue
|
2 years ago |
Jiarui Fang
|
556b9b7e1a
|
[hotfix] Dist Mgr gather torch version (#1284)
* make it faster
* [hotfix] torchvison fx tests
* [hotfix] rename duplicated named test_gpt.py
* [hotfix] dist mgr torch version
|
2 years ago |
Jiarui Fang
|
ae7d3f4927
|
[refactor] move process group from _DistSpec to ColoTensor. (#1203)
|
2 years ago |
Jiarui Fang
|
b5f25eb32a
|
[Tensor] add cpu group to ddp (#1200)
|
2 years ago |
Jiarui Fang
|
060b917daf
|
[refactor] remove gpc dependency in colotensor's _ops (#1189)
|
2 years ago |
Jiarui Fang
|
aa7bef73d4
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2 years ago |
ver217
|
634eecb98e
|
mark sanity_check of dist_spec_mgr as staticmethod (#1161)
|
2 years ago |
ver217
|
ffa025e120
|
[tensor] dist spec s2s uses all-to-all (#1136)
* dist spec s2s uses all-to-all
* update unit test
* add sanity check
* polish unitest test with titans
* add sanity check for DistMgr
* add sanity check
Co-authored-by: jiaruifang <fangjiarui123@gmail.com>
|
2 years ago |
Jiarui Fang
|
8cdce0399c
|
[ColoTensor] improves init functions. (#1150)
|
2 years ago |
Jiarui Fang
|
a00644079e
|
reorgnize colotensor directory (#1062)
* reorgnize colotensor directory
* polish code
|
3 years ago |
ver217
|
7faef93326
|
fix dist spec mgr (#1045)
|
3 years ago |
ver217
|
ad536e308e
|
[tensor] refactor colo-tensor (#992)
* refactor colo-tensor and update linear op
* polish code
* polish code
* update ops and unit tests
* update unit tests
* polish code
* rename dist_spec module
* polish code
* polish code
* remove unneeded import
* fix pipelinable
|
3 years ago |
Jiarui Fang
|
802ac297cc
|
[Tensor] remove useless import in tensor dir (#997)
|
3 years ago |
Ziyue Jiang
|
797a9dc5a9
|
add DistSpec for loss and test_model (#947)
|
3 years ago |
ver217
|
67c33f57eb
|
[tensor] design DistSpec and DistSpecManager for ColoTensor (#934)
* add dist spec
* update linear op
* polish code
* polish code
* update embedding op
* polish unit tests
* polish unit tests
* polish comments
* polish code
* add test_dist_spec_mgr
* polish code
* refactor folder structure
* polish unit tests
* add get_process_group() for TensorSpec
* polish code
|
3 years ago |