Commit Graph

870 Commits (11d1436a6712ab113991f209219deb3d3529ce79)
 

Author SHA1 Message Date
binmakeswell f28c021376 [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_sgd_kernel.cu code style (#978)
3 years ago
shenggan 18542b47fc [NFC] polish colossalai/nn/layer/parallel_2d/layers.py code style (#976)
3 years ago
Jie Zhu b67eebd20f [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_scale_kernel.cu code style (#977)
3 years ago
DouJS 52705ec5c5 [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/normalize_kernels.cu code style (#974)
3 years ago
Ofey Chan 136946422b [NFC] polish colossalai/kernel/cuda_native/csrc/layer_norm_cuda.cpp code style (#973)
3 years ago
Zirui Zhu 598cde4a0f [NFC] polish colossalai/nn/layer/parallel_2p5d/layers.py code style (#972)
3 years ago
Xu Kai 632e94abde [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/dropout.h code style (#970)
3 years ago
ExtremeViscent 22d1df224d [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/feed_forward.h (#968)
3 years ago
LuGY fb5bc6cb28 [NFC] polish colossalai/nn/layer/parallel_3d/layers.py code style (#966)
3 years ago
lucasliunju 955463e542 [NFC] polish __init__.py code style (#965)
3 years ago
Yuer867 7106a399fc [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/softmax.h code style (#964)
3 years ago
ziyu huang 5bd80b7dd1 [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/general_kernels.cu code style (#963)
3 years ago
superhao1995 48c4a180c7 [NFC] polish colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax.cpp code style (#959)
3 years ago
MaxT 442a2975ab [NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.h code style (#962)
3 years ago
runluo 89e2767a92 [NFC] polish colossalai/kernel/cuda_native/csrc/multi_tensor_l2norm_kernel.cu code style (#958)
3 years ago
doubleHU 1dc1b6fa00 [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cross_entropy_layer.h code style (#957)
3 years ago
RichardoLuo 0e922da874 [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/context.h code style (#956)
3 years ago
Wangbo Zhao(黑色枷锁) 8ca2a85682 [NFC] polish colossalai/kernel/cuda_native/scaled_softmax.py code style (#955)
3 years ago
Luxios22 f6970ef8b1 [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/softmax_kernels.cu code style (#954)
3 years ago
Cautiousss 0b86a6345e [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/cross_entropy.cu code style (#953)
3 years ago
Sze-qq d8d07b0e2b [NFC] polish colossalai/kernel/cuda_native/csrc/multihead_attention_1d.cpp code style (#952)
3 years ago
xyupeng fa43bb216d [NFC] polish colossalai/builder/pipeline.py code style (#951)
3 years ago
JT.Han c3e423c8be [NFC] polish colossalai/kernel/cuda_native/csrc/scaled_masked_softmax_cuda.cu code style (#949)
3 years ago
luoling-LC 72c71b67ec [NFC] polish colossalai/kernel/jit/bias_gelu.py code style (#946)
3 years ago
bajiaoyu517 eb9a81d72a [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.h code style (#945)
3 years ago
wky 8ffdc38376 [NFC] polish colossalai/kernel/cuda_native/csrc/moe_cuda.cpp code style (#942)
3 years ago
HaoyuQin c0f373db5d [NFC] polish pre-commit run --files colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax_cuda.cu code style (#943)
3 years ago
XYE 5bbefeb06a [NFC] polish moe_cuda_kernel.cu code style (#940)
3 years ago
Maruyama_Aya 7aa35eae6a [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/block_reduce.h code style (#938)
3 years ago
Geng Zhang b6cc9313ef [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.cpp code style (#936)
3 years ago
yuxuan-lou 44b6f8947b [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cuda_util.h code style (#939)
3 years ago
BoxiangW 872aa413c2 [NFC] Polish colossalai/kernel/cuda_native/csrc/multi_tensor_lamb.cu code style. (#937)
3 years ago
ver217 58580b50fe
Revert "[NFC] Hotfix/format (#984)" (#986)
3 years ago
binmakeswell 0772828fba
[NFC] Hotfix/format (#984)
3 years ago
ver217 5898ccf38b
udpate version (#982)
3 years ago
binmakeswell 7471f97fc3
update results on a single GPU, highlight quick view (#981)
3 years ago
ver217 c2fdc6a011
[tensor] derive compute pattern from dist spec (#971)
3 years ago
github-actions[bot] 46bc95708f
Automated submodule synchronization (#960)
3 years ago
Ziyue Jiang 797a9dc5a9
add DistSpec for loss and test_model (#947)
3 years ago
ver217 67c33f57eb
[tensor] design DistSpec and DistSpecManager for ColoTensor (#934)
3 years ago
Ziyue Jiang 830d3bca26
[Tensor] add optimizer to bert test (#933)
3 years ago
github-actions[bot] 7edb38193a
Automated submodule synchronization (#932)
3 years ago
Ziyue Jiang d73c2b1d79
[Tensor] fix init context (#931)
3 years ago
Ziyue Jiang dfc88b85ea
[Tensor] simplify named param (#928)
3 years ago
YuliangLiu0306 32a45cd7ef
[pipelinable]use pipelinable to support GPT model. (#903)
3 years ago
github-actions[bot] b61d64685f
Automated submodule synchronization (#929)
3 years ago
ver217 4ca732349e
[tensor] colo tensor overrides mul (#927)
3 years ago
ver217 45b9124df4
[tensor] hijack addmm for colo tensor (#923)
3 years ago
Jiarui Fang 534afb018a
test pretrain loading on multi-process (#922)
3 years ago
Ziyue Jiang c195d2814c
[Tensor] add from_pretrained support and bert pretrained test (#921)
3 years ago