Frank Lee
|
ae1b58cd16
|
[tensor] added linear implementation for the new sharding spec (#1416)
* [tensor] added linear implementation for the new sharding spec
* polish code
|
2 years ago |
Jiarui Fang
|
30b4dd17c0
|
[FAW] export FAW in _ops (#1438)
|
2 years ago |
Jiarui Fang
|
c9427a323f
|
hotfix #1434 (#1437)
|
2 years ago |
Jiarui Fang
|
10b3df65c8
|
[FAW] move coloparam setting in test code. (#1429)
|
2 years ago |
Jiarui Fang
|
cb98cf5558
|
[FAW] parallel FreqAwareEmbedding (#1424)
|
2 years ago |
Jiarui Fang
|
d209aff684
|
Add FreqAwareEmbeddingBag (#1421)
|
2 years ago |
Jiarui Fang
|
504419d261
|
[FAW] add cache manager for the cached embedding (#1419)
|
2 years ago |
HELSON
|
7a8702c06d
|
[colotensor] add Tensor.view op and its unit test (#1343)
[colotensor] add megatron initialization for gpt2
|
2 years ago |
HELSON
|
260a55804a
|
[hotfix] fix shape error in backward when using ColoTensor (#1298)
|
2 years ago |
HELSON
|
abba4d84e1
|
[hotfix] fix bert model test in unitests (#1272)
|
2 years ago |
Jiarui Fang
|
1aad903c15
|
[tensor] redistribute among different process groups (#1247)
* make it faster
* [tensor] rename convert_to_dist -> redistribute
* [tensor] ShardSpec and ReplicaSpec
* [tensor] redistribute among diff pgs
* polish code
|
2 years ago |
Jiarui Fang
|
9bcd2fd4af
|
[tensor] a shorter shard and replicate spec (#1245)
|
2 years ago |
Jiarui Fang
|
2699dfbbfd
|
[rename] convert_to_dist -> redistribute (#1243)
|
2 years ago |
Jiarui Fang
|
4a76084dc9
|
[tensor] add zero_like colo op, important for Optimizer (#1236)
|
2 years ago |
Jiarui Fang
|
3b500984b1
|
[tensor] fix some unittests (#1234)
|
2 years ago |
HELSON
|
0453776def
|
[tensor] fix a assertion in colo_tensor cross_entropy (#1232)
|
2 years ago |
HELSON
|
42ab36b762
|
[tensor] add unitest for colo_tensor 1DTP cross_entropy (#1230)
|
2 years ago |
Jiarui Fang
|
a98319f023
|
[tensor] torch function return colotensor (#1229)
|
2 years ago |
Jiarui Fang
|
ae7d3f4927
|
[refactor] move process group from _DistSpec to ColoTensor. (#1203)
|
2 years ago |
Jiarui Fang
|
060b917daf
|
[refactor] remove gpc dependency in colotensor's _ops (#1189)
|
2 years ago |
Jiarui Fang
|
1b657f9ce1
|
[tensor] revert local view back (#1178)
|
2 years ago |
Jiarui Fang
|
0dd4e2bbfb
|
[Tensor] rename some APIs in TensorSpec and Polish view unittest (#1176)
|
2 years ago |
Jiarui Fang
|
aa7bef73d4
|
[Tensor] distributed view supports inter-process hybrid parallel (#1169)
|
2 years ago |
Jiarui Fang
|
4b9bba8116
|
[ColoTensor] rename APIs and add output_replicate to ComputeSpec (#1168)
|
2 years ago |
Jiarui Fang
|
f4ef224358
|
[Tensor] remove ParallelAction, use ComputeSpec instread (#1166)
|
2 years ago |
Jiarui Fang
|
177c374401
|
remove gather out in parallel action (#1163)
|
2 years ago |
Jiarui Fang
|
07f9c781f9
|
[graph] improve the graph building. (#1157)
|
2 years ago |
ver217
|
22717a856f
|
[tensor] add embedding bag op (#1156)
|
2 years ago |
ver217
|
ae86151968
|
[tensor] add more element-wise ops (#1155)
* add more element-wise ops
* update test_op
* polish unit test
|
2 years ago |
ver217
|
ccf3c58c89
|
embedding op use gather_out (#1143)
|
2 years ago |
Jiarui Fang
|
a00644079e
|
reorgnize colotensor directory (#1062)
* reorgnize colotensor directory
* polish code
|
3 years ago |