Commit Graph

402 Commits (8ffdc3837679eee56000ec96e0bc46de9d3fff35)

Author SHA1 Message Date
wky 8ffdc38376 [NFC] polish colossalai/kernel/cuda_native/csrc/moe_cuda.cpp code style (#942)
3 years ago
HaoyuQin c0f373db5d [NFC] polish pre-commit run --files colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax_cuda.cu code style (#943)
3 years ago
XYE 5bbefeb06a [NFC] polish moe_cuda_kernel.cu code style (#940)
3 years ago
Maruyama_Aya 7aa35eae6a [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/block_reduce.h code style (#938)
3 years ago
Geng Zhang b6cc9313ef [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.cpp code style (#936)
3 years ago
yuxuan-lou 44b6f8947b [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cuda_util.h code style (#939)
3 years ago
BoxiangW 872aa413c2 [NFC] Polish colossalai/kernel/cuda_native/csrc/multi_tensor_lamb.cu code style. (#937)
3 years ago
ver217 58580b50fe
Revert "[NFC] Hotfix/format (#984)" (#986)
3 years ago
binmakeswell 0772828fba
[NFC] Hotfix/format (#984)
3 years ago
ver217 c2fdc6a011
[tensor] derive compute pattern from dist spec (#971)
3 years ago
Ziyue Jiang 797a9dc5a9
add DistSpec for loss and test_model (#947)
3 years ago
ver217 67c33f57eb
[tensor] design DistSpec and DistSpecManager for ColoTensor (#934)
3 years ago
Ziyue Jiang d73c2b1d79
[Tensor] fix init context (#931)
3 years ago
Ziyue Jiang dfc88b85ea
[Tensor] simplify named param (#928)
3 years ago
YuliangLiu0306 32a45cd7ef
[pipelinable]use pipelinable to support GPT model. (#903)
3 years ago
ver217 4ca732349e
[tensor] colo tensor overrides mul (#927)
3 years ago
ver217 45b9124df4
[tensor] hijack addmm for colo tensor (#923)
3 years ago
Ziyue Jiang c195d2814c
[Tensor] add from_pretrained support and bert pretrained test (#921)
3 years ago
Jiarui Fang 845856ea29
[Graph] building computing graph with ColoTensor, Linear only (#917)
3 years ago
Ziyue Jiang 75d221918a
[Tensor] add 1d vocab loss (#918)
3 years ago
Jiarui Fang ab95ec9aea
[Tensor] init ColoParameter (#914)
3 years ago
Ziyue Jiang f593a5637e
[Tensor] add embedding tp1d row (#904)
3 years ago
Ziyue Jiang 2c0d19d755
[Tensor] add ColoTensor TP1Dcol Embedding (#899)
3 years ago
Jiarui Fang d16671da75
[Tensor] initialize the ColoOptimizer (#898)
3 years ago
Jiarui Fang 676f191532
[Tensor] activation is an attr of ColoTensor (#897)
3 years ago
Ziyue Jiang cb182da7c5
[tensor] refine linear and add gather for laynorm (#893)
3 years ago
Jiarui Fang 26c49639d8
[Tensor] overriding paramters() for Module using ColoTensor (#889)
3 years ago
Ziyue Jiang 1d0aba4153
[tensor] add ColoTensor 1Dcol (#888)
3 years ago
Jiarui Fang 72cdc06875
[Tensor] make ColoTensor more robust for getattr (#886)
3 years ago
Ziyue Jiang 9bc5a77c31
[tensor] wrap function in the torch_tensor to ColoTensor (#881)
3 years ago
ver217 4df6471f5d
fix import error (#880)
3 years ago
Jiarui Fang 7f76517a85
[Tensor] make a simple net works with 1D row TP (#879)
3 years ago
ver217 c4d903e64a
[gemini] accelerate adjust_layout() (#878)
3 years ago
Jiarui Fang 909211453b
[Tensor] Add some attributes to ColoTensor (#877)
3 years ago
HELSON 425b4a96b8
[gemini] polish stateful_tensor_mgr (#876)
3 years ago
Jiarui Fang e43f83aa5c
[Tensor] get named parameters for model using ColoTensors (#874)
3 years ago
Jiarui Fang 96211c2cc8
[tensor] customized op returns ColoTensor (#875)
3 years ago
Ziyue Jiang 26d4ab8b03
[Tensor] Add function to spec and update linear 1Drow and unit tests (#869)
3 years ago
Frank Lee 11f54c7b6b
[doc] improved docstring and assertion messages for the engine module (#871)
3 years ago
Frank Lee 1c34382678
[doc] improved assertion messages in trainer (#873)
3 years ago
Frank Lee 7a64fae33a
[doc] improved error messages in initialize (#872)
3 years ago
Jiarui Fang 1190b2c4a4
[tensor] add cross_entrophy_loss (#868)
3 years ago
HELSON 3107817172
[gemini] add stateful tensor container (#867)
3 years ago
Jiarui Fang d01d3b8cb0
colo init context add device attr. (#866)
3 years ago
Frank Lee 2238758c2e
[usability] improved error messages in the context module (#856)
3 years ago
Frank Lee 9fdebadd69
[doc] improved docstring in the amp module (#857)
3 years ago
Frank Lee b862d89d00
[doc] improved docstring in the logging module (#861)
3 years ago
Frank Lee 8004c8e938
[doc] improved docstring in the communication module (#863)
3 years ago
Jiarui Fang 8af5f7423d
[tensor] an initial dea of tensor spec (#865)
3 years ago
Jiarui Fang 126ba573a8
[Tensor] add layer norm Op (#852)
3 years ago