Commit Graph

640 Commits (007ca0df925350c70ee8ff324599f8e156cec59a)
 

Author SHA1 Message Date
yuxuan-lou 44b6f8947b [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cuda_util.h code style (#939)
3 years ago
BoxiangW 872aa413c2 [NFC] Polish colossalai/kernel/cuda_native/csrc/multi_tensor_lamb.cu code style. (#937)
3 years ago
ver217 58580b50fe
Revert "[NFC] Hotfix/format (#984)" (#986)
3 years ago
binmakeswell 0772828fba
[NFC] Hotfix/format (#984)
3 years ago
ver217 5898ccf38b
udpate version (#982)
3 years ago
binmakeswell 7471f97fc3
update results on a single GPU, highlight quick view (#981)
3 years ago
ver217 c2fdc6a011
[tensor] derive compute pattern from dist spec (#971)
3 years ago
github-actions[bot] 46bc95708f
Automated submodule synchronization (#960)
3 years ago
Ziyue Jiang 797a9dc5a9
add DistSpec for loss and test_model (#947)
3 years ago
ver217 67c33f57eb
[tensor] design DistSpec and DistSpecManager for ColoTensor (#934)
3 years ago
Ziyue Jiang 830d3bca26
[Tensor] add optimizer to bert test (#933)
3 years ago
github-actions[bot] 7edb38193a
Automated submodule synchronization (#932)
3 years ago
Ziyue Jiang d73c2b1d79
[Tensor] fix init context (#931)
3 years ago
Ziyue Jiang dfc88b85ea
[Tensor] simplify named param (#928)
3 years ago
YuliangLiu0306 32a45cd7ef
[pipelinable]use pipelinable to support GPT model. (#903)
3 years ago
github-actions[bot] b61d64685f
Automated submodule synchronization (#929)
3 years ago
ver217 4ca732349e
[tensor] colo tensor overrides mul (#927)
3 years ago
ver217 45b9124df4
[tensor] hijack addmm for colo tensor (#923)
3 years ago
Jiarui Fang 534afb018a
test pretrain loading on multi-process (#922)
3 years ago
Ziyue Jiang c195d2814c
[Tensor] add from_pretrained support and bert pretrained test (#921)
3 years ago
ver217 1d625fcd36
[setup] support more cuda architectures (#920)
3 years ago
ver217 5d8f1262fb
update cuda ext cc flags (#919)
3 years ago
Jiarui Fang 845856ea29
[Graph] building computing graph with ColoTensor, Linear only (#917)
3 years ago
Ziyue Jiang 75d221918a
[Tensor] add 1d vocab loss (#918)
3 years ago
Ziyue Jiang dfaff4e243
[Tensor] fix test_model (#916)
3 years ago
Jiarui Fang ed6426c300
[Tensor] polish model test (#915)
3 years ago
Ziyue Jiang 0fab86b12a
[Tensor] add a basic bert. (#911)
3 years ago
Jiarui Fang ab95ec9aea
[Tensor] init ColoParameter (#914)
3 years ago
Ziyue Jiang 193d629311
update pytest.mark.parametrize in tensor tests (#913)
3 years ago
github-actions[bot] 1cf7fb3cd9
Automated submodule synchronization (#912)
3 years ago
Frank Lee f0f35216f1
[ci] added wheel build scripts (#910)
3 years ago
ver217 150b1a7453
update local version format (#909)
3 years ago
github-actions[bot] 3b1f5f07ce
Automated submodule synchronization (#907)
3 years ago
Ziyue Jiang f593a5637e
[Tensor] add embedding tp1d row (#904)
3 years ago
ver217 16122d5fac
update release bdist CI (#902)
3 years ago
Ziyue Jiang 2c0d19d755
[Tensor] add ColoTensor TP1Dcol Embedding (#899)
3 years ago
ver217 e46e423c00
add CI for releasing bdist wheel (#901)
3 years ago
Jiarui Fang e1108caf7d
change version to 0.1.4 (#900)
3 years ago
Jiarui Fang d16671da75
[Tensor] initialize the ColoOptimizer (#898)
3 years ago
Jiarui Fang 676f191532
[Tensor] activation is an attr of ColoTensor (#897)
3 years ago
Jiarui Fang e76f76c08b
[Tensor] test parameters() as member function (#896)
3 years ago
Ziyue Jiang cb182da7c5
[tensor] refine linear and add gather for laynorm (#893)
3 years ago
Jiarui Fang 26c49639d8
[Tensor] overriding paramters() for Module using ColoTensor (#889)
3 years ago
ver217 daf59ff72e
[setup] add local version label (#890)
3 years ago
Ziyue Jiang 1d0aba4153
[tensor] add ColoTensor 1Dcol (#888)
3 years ago
Jiarui Fang a0e5971692
[Tensor] test model check results for a simple net (#887)
3 years ago
Jiarui Fang 72cdc06875
[Tensor] make ColoTensor more robust for getattr (#886)
3 years ago
Ziyue Jiang 9bc5a77c31
[tensor] wrap function in the torch_tensor to ColoTensor (#881)
3 years ago
ver217 4df6471f5d
fix import error (#880)
3 years ago
Jiarui Fang 7f76517a85
[Tensor] make a simple net works with 1D row TP (#879)
3 years ago