Frank Lee
|
e4685832f8
|
[engine] fixed bug in gradient accumulation dataloader to keep the last step (#1030)
|
2022-05-26 14:28:23 +08:00 |
Ziyue Jiang
|
32291dd73f
|
[Tensor] add module handler for linear (#1021)
* add module spec for linear
* polish
* polish
* polish
|
2022-05-26 11:50:44 +08:00 |
Frank Lee
|
ee50497db2
|
[ci] fixed nightly build workflow (#1029)
|
2022-05-26 11:42:50 +08:00 |
Ryan Russell
|
9b0c037027
|
fix typo in constants (#1027)
|
2022-05-26 08:45:08 +08:00 |
ver217
|
007ca0df92
|
fix colo init context (#1026)
|
2022-05-25 20:41:58 +08:00 |
Frank Lee
|
58a7dd2ede
|
[ci] fixed nightly build workflow (#1022)
* [ci] fixed nightly build workflow
* [ci] fixed nightly build workflow
* [ci] fixed nightly build workflow
|
2022-05-24 22:38:56 +08:00 |
Frank Lee
|
1a76c88aba
|
[ci] added nightly build (#1018) (#1019)
|
2022-05-24 17:56:01 +08:00 |
Frank Lee
|
8d06186ff9
|
[doc] update docker instruction (#1020)
|
2022-05-24 17:51:50 +08:00 |
Frank Lee
|
e17a43184b
|
[ci] update the docker image name (#1017)
|
2022-05-24 16:53:39 +08:00 |
YuliangLiu0306
|
d182b0bd47
|
[hotfix] fix some bugs caused by size mismatch. (#1011)
* [CLI] add CLI launcher
* Revert "[CLI] add CLI launcher"
This reverts commit df7e6506d4 .
* [hotfix]fix some bugs caused by size mismatch.
* add warning logs
* polish
|
2022-05-23 14:02:28 +08:00 |
binmakeswell
|
9833d814d5
|
[NFC] fix paper link
|
2022-05-21 18:34:36 +08:00 |
ver217
|
cefc29ff06
|
[tensor] impl ColoDDP for ColoTensor (#1009)
* impl ColoDDP for ColoTensor
* polish code
|
2022-05-21 13:52:04 +08:00 |
zhengzangw
|
ae7c338105
|
[NFC] polish colossalai/kernel/cuda_native/csrc/colossal_C_frontend.cpp code style
|
2022-05-20 23:57:38 +08:00 |
ver217
|
a3b66f6def
|
[tensor] refactor parallel action (#1007)
* refactor parallel action
* polish unit tests
|
2022-05-20 20:19:58 +08:00 |
github-actions[bot]
|
9e3d602dba
|
Automated submodule synchronization (#1003)
Co-authored-by: github-actions <github-actions@github.com>
|
2022-05-20 17:08:44 +08:00 |
ver217
|
8e3d0ad8f1
|
[unit test] refactor test tensor (#1005)
* polish test_gpt
* update op unit tests
* update test model
|
2022-05-19 18:57:56 +08:00 |
ver217
|
ad536e308e
|
[tensor] refactor colo-tensor (#992)
* refactor colo-tensor and update linear op
* polish code
* polish code
* update ops and unit tests
* update unit tests
* polish code
* rename dist_spec module
* polish code
* polish code
* remove unneeded import
* fix pipelinable
|
2022-05-19 12:44:59 +08:00 |
Frank Lee
|
1467d83edf
|
[cli] remove unused imports (#1001)
|
2022-05-18 23:27:18 +08:00 |
Frank Lee
|
533d0c46d8
|
[kernel] fixed the include bug in dropout kernel (#999)
|
2022-05-18 21:43:18 +08:00 |
binmakeswell
|
c27ea0d980
|
fix download link (#998)
|
2022-05-18 18:05:18 +08:00 |
Jiarui Fang
|
802ac297cc
|
[Tensor] remove useless import in tensor dir (#997)
|
2022-05-18 14:54:51 +08:00 |
Ziheng Qin
|
571f12eff3
|
[NFC] polish colossalai/nn/layer/utils/common.py code style (#983)
|
2022-05-17 10:25:06 +08:00 |
puck_WCR
|
bda70b4b66
|
[NFC] polish colossalai/kernel/cuda_native/layer_norm.py code style (#980)
|
2022-05-17 10:25:06 +08:00 |
Kai Wang (Victor Kai)
|
c50c08dcbb
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/dropout_kernels.cu code style (#979)
|
2022-05-17 10:25:06 +08:00 |
binmakeswell
|
f28c021376
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_sgd_kernel.cu code style (#978)
|
2022-05-17 10:25:06 +08:00 |
shenggan
|
18542b47fc
|
[NFC] polish colossalai/nn/layer/parallel_2d/layers.py code style (#976)
|
2022-05-17 10:25:06 +08:00 |
Jie Zhu
|
b67eebd20f
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_scale_kernel.cu code style (#977)
|
2022-05-17 10:25:06 +08:00 |
DouJS
|
52705ec5c5
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/normalize_kernels.cu code style (#974)
|
2022-05-17 10:25:06 +08:00 |
Ofey Chan
|
136946422b
|
[NFC] polish colossalai/kernel/cuda_native/csrc/layer_norm_cuda.cpp code style (#973)
|
2022-05-17 10:25:06 +08:00 |
Zirui Zhu
|
598cde4a0f
|
[NFC] polish colossalai/nn/layer/parallel_2p5d/layers.py code style (#972)
|
2022-05-17 10:25:06 +08:00 |
Xu Kai
|
632e94abde
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/dropout.h code style (#970)
|
2022-05-17 10:25:06 +08:00 |
ExtremeViscent
|
22d1df224d
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/feed_forward.h (#968)
code style
|
2022-05-17 10:25:06 +08:00 |
LuGY
|
fb5bc6cb28
|
[NFC] polish colossalai/nn/layer/parallel_3d/layers.py code style (#966)
|
2022-05-17 10:25:06 +08:00 |
lucasliunju
|
955463e542
|
[NFC] polish __init__.py code style (#965)
|
2022-05-17 10:25:06 +08:00 |
Yuer867
|
7106a399fc
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/softmax.h code style (#964)
|
2022-05-17 10:25:06 +08:00 |
ziyu huang
|
5bd80b7dd1
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/general_kernels.cu code style (#963)
Co-authored-by: “Arsmart123 <202476410arsmart@gmail.com>
|
2022-05-17 10:25:06 +08:00 |
superhao1995
|
48c4a180c7
|
[NFC] polish colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax.cpp code style (#959)
|
2022-05-17 10:25:06 +08:00 |
MaxT
|
442a2975ab
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.h code style (#962)
|
2022-05-17 10:25:06 +08:00 |
runluo
|
89e2767a92
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_l2norm_kernel.cu code style (#958)
|
2022-05-17 10:25:06 +08:00 |
doubleHU
|
1dc1b6fa00
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cross_entropy_layer.h code style (#957)
|
2022-05-17 10:25:06 +08:00 |
RichardoLuo
|
0e922da874
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/context.h code style (#956)
Co-authored-by: RichardoLuo <14049555596@qq.com>
|
2022-05-17 10:25:06 +08:00 |
Wangbo Zhao(黑色枷锁)
|
8ca2a85682
|
[NFC] polish colossalai/kernel/cuda_native/scaled_softmax.py code style (#955)
|
2022-05-17 10:25:06 +08:00 |
Luxios22
|
f6970ef8b1
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/softmax_kernels.cu code style (#954)
|
2022-05-17 10:25:06 +08:00 |
Cautiousss
|
0b86a6345e
|
[NFC] polish colossalai/kernel/cuda_native/csrc/kernels/cross_entropy.cu code style (#953)
Co-authored-by: 何晓昕 <cautious@hexiaoxins-MacBook-Pro.local>
|
2022-05-17 10:25:06 +08:00 |
Sze-qq
|
d8d07b0e2b
|
[NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.cpp code style (#952)
|
2022-05-17 10:25:06 +08:00 |
xyupeng
|
fa43bb216d
|
[NFC] polish colossalai/builder/pipeline.py code style (#951)
|
2022-05-17 10:25:06 +08:00 |
JT.Han
|
c3e423c8be
|
[NFC] polish colossalai/kernel/cuda_native/csrc/scaled_masked_softmax_cuda.cu code style (#949)
Co-authored-by: Jiatong <jiatong.han@u.nus.edu>
|
2022-05-17 10:25:06 +08:00 |
luoling-LC
|
72c71b67ec
|
[NFC] polish colossalai/kernel/jit/bias_gelu.py code style (#946)
Co-authored-by: jnbai <897086360@qq.com>
|
2022-05-17 10:25:06 +08:00 |
bajiaoyu517
|
eb9a81d72a
|
[NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.h code style (#945)
|
2022-05-17 10:25:06 +08:00 |
wky
|
8ffdc38376
|
[NFC] polish colossalai/kernel/cuda_native/csrc/moe_cuda.cpp code style (#942)
|
2022-05-17 10:25:06 +08:00 |