binmakeswell
|
626dd187e4
|
add inference submodule (#1047)
|
3 years ago |
ver217
|
7faef93326
|
fix dist spec mgr (#1045)
|
3 years ago |
ver217
|
9492a561c3
|
[tensor] ColoTensor supports ZeRo (#1015)
* impl chunk manager
* impl param op hook
* add reduce_chunk
* add zero hook v2
* add zero dp
* fix TensorInfo
* impl load balancing when using zero without chunk
* fix zero hook
* polish chunk
* fix bugs
* ddp ok
* zero ok
* polish code
* fix bugs about load balancing
* polish code
* polish code
* add ene-to-end test
* polish code
* polish code
* polish code
* fix typo
* add test_chunk
* fix bugs
* fix bugs
* polish code
|
3 years ago |
Frank Lee
|
cfa6c1b46b
|
[ci] fixed nightly build workflow (#1040)
|
3 years ago |
YuliangLiu0306
|
9feff0f760
|
[titans]remove model zoo (#1042)
* [CLI] add CLI launcher
* Revert "[CLI] add CLI launcher"
This reverts commit df7e6506d4 .
* rm model zoo
|
3 years ago |
binmakeswell
|
0dac86866b
|
[NFC] add inference (#1044)
|
3 years ago |
Ziyue Jiang
|
7c530b9de2
|
[Tensor] add Parameter inheritance for ColoParameter (#1041)
* add Parameter inheritance for ColoParameter
* remove tricks
* remove tricks
* polish
* polish
|
3 years ago |
github-actions[bot]
|
4d8a574cd3
|
Automated submodule synchronization (#1034)
Co-authored-by: github-actions <github-actions@github.com>
|
3 years ago |
ver217
|
7cfd6c827e
|
[zero] add load_state_dict for sharded model (#894)
* add load_state_dict for sharded model
* fix bug
* fix bug
* fix ckpt dtype and device
* support load state dict in zero init ctx
* fix bugs
|
3 years ago |
Ziyue Jiang
|
6c5996a56e
|
[Tensor] add module check and bert test (#1031)
* add Embedding
* Add bert test
* polish
* add check module test
* polish
* polish
* polish
* polish
|
3 years ago |
YuliangLiu0306
|
7106bd671d
|
[p2p]add object list send/recv (#1024)
* [CLI] add CLI launcher
* Revert "[CLI] add CLI launcher"
This reverts commit df7e6506d4 .
* [p2p]add object list send recv
* refactor for code reusability
* polish
|
3 years ago |
Frank Lee
|
e4685832f8
|
[engine] fixed bug in gradient accumulation dataloader to keep the last step (#1030)
|
3 years ago |
Ziyue Jiang
|
32291dd73f
|
[Tensor] add module handler for linear (#1021)
* add module spec for linear
* polish
* polish
* polish
|
3 years ago |
Frank Lee
|
ee50497db2
|
[ci] fixed nightly build workflow (#1029)
|
3 years ago |
Ryan Russell
|
9b0c037027
|
fix typo in constants (#1027)
|
3 years ago |
ver217
|
007ca0df92
|
fix colo init context (#1026)
|
3 years ago |
Frank Lee
|
58a7dd2ede
|
[ci] fixed nightly build workflow (#1022)
* [ci] fixed nightly build workflow
* [ci] fixed nightly build workflow
* [ci] fixed nightly build workflow
|
3 years ago |
Frank Lee
|
1a76c88aba
|
[ci] added nightly build (#1018) (#1019)
|
3 years ago |
Frank Lee
|
8d06186ff9
|
[doc] update docker instruction (#1020)
|
3 years ago |
Frank Lee
|
e17a43184b
|
[ci] update the docker image name (#1017)
|
3 years ago |
YuliangLiu0306
|
d182b0bd47
|
[hotfix] fix some bugs caused by size mismatch. (#1011)
* [CLI] add CLI launcher
* Revert "[CLI] add CLI launcher"
This reverts commit df7e6506d4 .
* [hotfix]fix some bugs caused by size mismatch.
* add warning logs
* polish
|
3 years ago |
binmakeswell
|
9833d814d5
|
[NFC] fix paper link
|
3 years ago |
ver217
|
cefc29ff06
|
[tensor] impl ColoDDP for ColoTensor (#1009)
* impl ColoDDP for ColoTensor
* polish code
|
3 years ago |
zhengzangw
|
ae7c338105
|
[NFC] polish colossalai/kernel/cuda_native/csrc/colossal_C_frontend.cpp code style
|
3 years ago |
ver217
|
a3b66f6def
|
[tensor] refactor parallel action (#1007)
* refactor parallel action
* polish unit tests
|
3 years ago |
github-actions[bot]
|
9e3d602dba
|
Automated submodule synchronization (#1003)
Co-authored-by: github-actions <github-actions@github.com>
|
3 years ago |
ver217
|
8e3d0ad8f1
|
[unit test] refactor test tensor (#1005)
* polish test_gpt
* update op unit tests
* update test model
|
3 years ago |
ver217
|
ad536e308e
|
[tensor] refactor colo-tensor (#992)
* refactor colo-tensor and update linear op
* polish code
* polish code
* update ops and unit tests
* update unit tests
* polish code
* rename dist_spec module
* polish code
* polish code
* remove unneeded import
* fix pipelinable
|
3 years ago |
Frank Lee
|
1467d83edf
|
[cli] remove unused imports (#1001)
|
3 years ago |
Frank Lee
|
533d0c46d8
|
[kernel] fixed the include bug in dropout kernel (#999)
|
3 years ago |
binmakeswell
|
c27ea0d980
|
fix download link (#998)
|
3 years ago |
Jiarui Fang
|
802ac297cc
|
[Tensor] remove useless import in tensor dir (#997)
|
3 years ago |
Ziheng Qin
|
571f12eff3
|
[NFC] polish colossalai/nn/layer/utils/common.py code style (#983)
|
3 years ago |
puck_WCR
|
bda70b4b66
|
[NFC] polish colossalai/kernel/cuda_native/layer_norm.py code style (#980)
|
3 years ago |
Kai Wang (Victor Kai)
|
c50c08dcbb
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/dropout_kernels.cu code style (#979)
|
3 years ago |
binmakeswell
|
f28c021376
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_sgd_kernel.cu code style (#978)
|
3 years ago |
shenggan
|
18542b47fc
|
[NFC] polish colossalai/nn/layer/parallel_2d/layers.py code style (#976)
|
3 years ago |
Jie Zhu
|
b67eebd20f
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_scale_kernel.cu code style (#977)
|
3 years ago |
DouJS
|
52705ec5c5
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/normalize_kernels.cu code style (#974)
|
3 years ago |
Ofey Chan
|
136946422b
|
[NFC] polish colossalai/kernel/cuda_native/csrc/layer_norm_cuda.cpp code style (#973)
|
3 years ago |
Zirui Zhu
|
598cde4a0f
|
[NFC] polish colossalai/nn/layer/parallel_2p5d/layers.py code style (#972)
|
3 years ago |
Xu Kai
|
632e94abde
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/dropout.h code style (#970)
|
3 years ago |
ExtremeViscent
|
22d1df224d
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/feed_forward.h (#968)
code style
|
3 years ago |
LuGY
|
fb5bc6cb28
|
[NFC] polish colossalai/nn/layer/parallel_3d/layers.py code style (#966)
|
3 years ago |
lucasliunju
|
955463e542
|
[NFC] polish __init__.py code style (#965)
|
3 years ago |
Yuer867
|
7106a399fc
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/softmax.h code style (#964)
|
3 years ago |
ziyu huang
|
5bd80b7dd1
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/general_kernels.cu code style (#963)
Co-authored-by: “Arsmart123 <202476410arsmart@gmail.com>
|
3 years ago |
superhao1995
|
48c4a180c7
|
[NFC] polish colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax.cpp code style (#959)
|
3 years ago |
MaxT
|
442a2975ab
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.h code style (#962)
|
3 years ago |
runluo
|
89e2767a92
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_l2norm_kernel.cu code style (#958)
|
3 years ago |