Commit Graph

455 Commits (3d10be33bdd89d4b93d95486dcf8d386ad9ae5e6)

Author SHA1 Message Date
JT.Han c3e423c8be [NFC] polish colossalai/kernel/cuda_native/csrc/scaled_masked_softmax_cuda.cu code style (#949)
3 years ago
luoling-LC 72c71b67ec [NFC] polish colossalai/kernel/jit/bias_gelu.py code style (#946)
3 years ago
bajiaoyu517 eb9a81d72a [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.h code style (#945)
3 years ago
wky 8ffdc38376 [NFC] polish colossalai/kernel/cuda_native/csrc/moe_cuda.cpp code style (#942)
3 years ago
HaoyuQin c0f373db5d [NFC] polish pre-commit run --files colossalai/kernel/cuda_native/csrc/scaled_upper_triang_masked_softmax_cuda.cu code style (#943)
3 years ago
XYE 5bbefeb06a [NFC] polish moe_cuda_kernel.cu code style (#940)
3 years ago
Maruyama_Aya 7aa35eae6a [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/block_reduce.h code style (#938)
3 years ago
Geng Zhang b6cc9313ef [NFC] polish colossalai/kernel/cuda_native/csrc/cpu_adam.cpp code style (#936)
3 years ago
yuxuan-lou 44b6f8947b [NFC] polish colossalai/kernel/cuda_native/csrc/kernels/include/cuda_util.h code style (#939)
3 years ago
BoxiangW 872aa413c2 [NFC] Polish colossalai/kernel/cuda_native/csrc/multi_tensor_lamb.cu code style. (#937)
3 years ago
ver217 58580b50fe
Revert "[NFC] Hotfix/format (#984)" (#986)
3 years ago
binmakeswell 0772828fba
[NFC] Hotfix/format (#984)
3 years ago
ver217 c2fdc6a011
[tensor] derive compute pattern from dist spec (#971)
3 years ago
Ziyue Jiang 797a9dc5a9
add DistSpec for loss and test_model (#947)
3 years ago
ver217 67c33f57eb
[tensor] design DistSpec and DistSpecManager for ColoTensor (#934)
3 years ago
Ziyue Jiang d73c2b1d79
[Tensor] fix init context (#931)
3 years ago
Ziyue Jiang dfc88b85ea
[Tensor] simplify named param (#928)
3 years ago
YuliangLiu0306 32a45cd7ef
[pipelinable]use pipelinable to support GPT model. (#903)
3 years ago
ver217 4ca732349e
[tensor] colo tensor overrides mul (#927)
3 years ago
ver217 45b9124df4
[tensor] hijack addmm for colo tensor (#923)
3 years ago
Ziyue Jiang c195d2814c
[Tensor] add from_pretrained support and bert pretrained test (#921)
3 years ago
Jiarui Fang 845856ea29
[Graph] building computing graph with ColoTensor, Linear only (#917)
3 years ago
Ziyue Jiang 75d221918a
[Tensor] add 1d vocab loss (#918)
3 years ago
Jiarui Fang ab95ec9aea
[Tensor] init ColoParameter (#914)
3 years ago
Ziyue Jiang f593a5637e
[Tensor] add embedding tp1d row (#904)
3 years ago
Ziyue Jiang 2c0d19d755
[Tensor] add ColoTensor TP1Dcol Embedding (#899)
3 years ago
Jiarui Fang d16671da75
[Tensor] initialize the ColoOptimizer (#898)
3 years ago
Jiarui Fang 676f191532
[Tensor] activation is an attr of ColoTensor (#897)
3 years ago
Ziyue Jiang cb182da7c5
[tensor] refine linear and add gather for laynorm (#893)
3 years ago
Jiarui Fang 26c49639d8
[Tensor] overriding paramters() for Module using ColoTensor (#889)
3 years ago
Ziyue Jiang 1d0aba4153
[tensor] add ColoTensor 1Dcol (#888)
3 years ago
Jiarui Fang 72cdc06875
[Tensor] make ColoTensor more robust for getattr (#886)
3 years ago
Ziyue Jiang 9bc5a77c31
[tensor] wrap function in the torch_tensor to ColoTensor (#881)
3 years ago
ver217 4df6471f5d
fix import error (#880)
3 years ago
Jiarui Fang 7f76517a85
[Tensor] make a simple net works with 1D row TP (#879)
3 years ago
ver217 c4d903e64a
[gemini] accelerate adjust_layout() (#878)
3 years ago
Jiarui Fang 909211453b
[Tensor] Add some attributes to ColoTensor (#877)
3 years ago
HELSON 425b4a96b8
[gemini] polish stateful_tensor_mgr (#876)
3 years ago
Jiarui Fang e43f83aa5c
[Tensor] get named parameters for model using ColoTensors (#874)
3 years ago
Jiarui Fang 96211c2cc8
[tensor] customized op returns ColoTensor (#875)
3 years ago
Ziyue Jiang 26d4ab8b03
[Tensor] Add function to spec and update linear 1Drow and unit tests (#869)
3 years ago
Frank Lee 11f54c7b6b
[doc] improved docstring and assertion messages for the engine module (#871)
3 years ago
Frank Lee 1c34382678
[doc] improved assertion messages in trainer (#873)
3 years ago
Frank Lee 7a64fae33a
[doc] improved error messages in initialize (#872)
3 years ago
Jiarui Fang 1190b2c4a4
[tensor] add cross_entrophy_loss (#868)
3 years ago
HELSON 3107817172
[gemini] add stateful tensor container (#867)
3 years ago
Jiarui Fang d01d3b8cb0
colo init context add device attr. (#866)
3 years ago
Frank Lee 2238758c2e
[usability] improved error messages in the context module (#856)
3 years ago
Frank Lee 9fdebadd69
[doc] improved docstring in the amp module (#857)
3 years ago
Frank Lee b862d89d00
[doc] improved docstring in the logging module (#861)
3 years ago
Frank Lee 8004c8e938
[doc] improved docstring in the communication module (#863)
3 years ago
Jiarui Fang 8af5f7423d
[tensor] an initial dea of tensor spec (#865)
3 years ago
Jiarui Fang 126ba573a8
[Tensor] add layer norm Op (#852)
3 years ago
Frank Lee a82da26f7e
[cli] refactored micro-benchmarking cli and added more metrics (#858)
3 years ago
Frank Lee ee222dfbf3
[usability] added assertion message in registry (#864)
3 years ago
HELSON f0e654558f
[gemini] polish code (#855)
3 years ago
Jiarui Fang 29159d9b5b
hotfix tensor unittest bugs (#862)
3 years ago
YuliangLiu0306 c6930d8ddf
[pipelinable]use ColoTensor to replace dummy tensor. (#853)
3 years ago
Ziyue Jiang bcc8655021
[Tensor ] Add 1Drow weight reshard by spec (#854)
3 years ago
ver217 d7e0303d1e
[zero] use GeminiMemoryManager when sampling model data (#850)
3 years ago
ver217 232142f402
[utils] refactor profiler (#837)
3 years ago
Jiarui Fang 62f059251b
[Tensor] init a tp network training unittest (#849)
3 years ago
ver217 0dea140760
[hotfix] add deconstructor for stateful tensor (#848)
3 years ago
ver217 0f7ed8c192
fix _post_init_method of zero init ctx (#847)
3 years ago
Ziyue Jiang 2a0a427e04
[tensor]add assert for colo_tensor 1Drow (#846)
3 years ago
Ziyue Jiang 05023ecfee
[Tensor] TP Linear 1D row (#843)
3 years ago
Frank Lee cf6d1c9284
[CLI] refactored the launch CLI and fixed bugs in multi-node launching (#844)
3 years ago
HELSON e5ea3fdeef
[gemini] add GeminiMemoryManger (#832)
3 years ago
YuliangLiu0306 35ea6e1023
[pipelinable]use pipelinable context to initialize non-pipeline model (#816)
3 years ago
Jiarui Fang ea0a2ed25f
[hotfix] the bug of numel() in ColoTensor (#845)
3 years ago
LuGY c1e8d2001e
modefied the pp build for ckpt adaptation (#803)
3 years ago
Jiarui Fang 8789850eea
Init Conext supports lazy allocate model memory (#842)
3 years ago
Jiarui Fang 4575a3298b
[hotfix] ColoTensor pin_memory (#840)
3 years ago
Frank Lee 01e9f834f5
[dependency] removed torchvision (#833)
3 years ago
Jiarui Fang cb5a4778e1
Revert "[WIP] Applying ColoTensor on TP-1D-row Linear. (#831)" (#835)
3 years ago
Jiarui Fang ac88de6dfc
[WIP] Applying ColoTensor on TP-1D-row Linear. (#831)
3 years ago
Jiarui Fang 595bedf767
revert zero tensors back (#829)
3 years ago
Jiarui Fang 294a6060d0
[tensor] ZeRO use ColoTensor as the base class. (#828)
3 years ago
Ziyue Jiang 8e6fdb4f29
[tensor]fix test_linear (#826)
3 years ago
Ziyue Jiang 1a9e2c2dff
[tensor] fix kwargs in colo_tensor torch_funtion (#825)
3 years ago
Jiarui Fang eb1b89908c
[refactor] moving InsertPostInitMethodToModuleSubClasses to utils. (#824)
3 years ago
Jiarui Fang 2ecc3d7a55
[tensor] lazy init (#823)
3 years ago
Jiarui Fang 68dcd51d41
[Tensor] update ColoTensor torch_function (#822)
3 years ago
Jiarui Fang 0ce8924ceb
[tensor] reorganize files (#820)
3 years ago
Jiarui Fang ab962b9735
[gemini] a new tensor structure (#818)
3 years ago
FrankLeeeee 70ed11d07e [cli] added check installation cli
3 years ago
YuliangLiu0306 c7eca40f51
Merge pull request #812 from FrankLeeeee/feature/cli
3 years ago
Jiarui Fang 3ddbd1bce1
[gemini] collect cpu-gpu moving volume in each iteration (#813)
3 years ago
FrankLeeeee d522cb704e [cli] fixed single-node process launching
3 years ago
Jiarui Fang 61c20b44bc
[log] local throughput metrics (#811)
3 years ago
ver217 dd92b90a68
[DO NOT MERGE] [zero] init fp16 params directly in ZeroInitContext (#808)
3 years ago
Jiarui Fang 227d1cd4b3
[gemini] APIs to set cpu memory capacity (#809)
3 years ago
FrankLeeeee f63e91d280 [cli] fixed a bug in user args and refactored the module structure
3 years ago
Jiarui Fang e761ad2cd7
Revert "[zero] add ZeroTensorShardStrategy (#793)" (#806)
3 years ago
HELSON 88759e289e
[zero] add ZeroTensorShardStrategy (#793)
3 years ago
Jiarui Fang 681addb512
[refactor] moving grad acc logic to engine (#804)
3 years ago
Frank Lee 05d9ae5999
[cli] add missing requirement (#805)
3 years ago
YuliangLiu0306 de2f581d43
[cli] added micro benchmarking for tp (#789)
3 years ago
YuliangLiu0306 cfadc9df8e
[cli] added distributed launcher command (#791)
3 years ago
Jiarui Fang 4d9332b4c5
[refactor] moving memtracer to gemini (#801)
3 years ago