1235 Commits (e8a9bebc8770b9430f4150a400e6fef43cf02d4f)
 

Author SHA1 Message Date
ver217 c9e8ce67b8
fix move fp32 shards (#1604) 2 years ago
YuliangLiu0306 eac1b79371
[autoparallel] add bcast op handler (#1600) 2 years ago
Frank Lee 3abf98a633
[autoparallel] added all non-bcast matmul strategies (#1603) 2 years ago
Frank Lee db98b695b2
[autoparallel] added strategy generator and bmm strategies (#1602) 2 years ago
Jiarui Fang a19eb80998
[embedding] updates some default parameters 2 years ago
Super Daniel cd5cf2bcc9
[fx/tuning] tune performance on rotor with meta info. (#1599) 2 years ago
Boyuan Yao a7cda6f57d
[fx] Add offload codegen (#1598) 2 years ago
Super Daniel c8e9b2ad78
[hotfix/rotor] fix variable names (#1597) 2 years ago
YuliangLiu0306 faa23b9d9a
[autoparallel] add reshape handler (#1594) 2 years ago
github-actions[bot] c938dda028
Automated submodule synchronization (#1596) 2 years ago
Super Daniel 5c494d4540
[fx] provide an accurate estimation of memory. (#1587) 2 years ago
Frank Lee 27fe8af60c
[autoparallel] refactored shape consistency to remove redundancy (#1591) 2 years ago
YuliangLiu0306 d164449d00
[autoparallel] add resnet autoparallel unit test and add backward weight communication cost (#1589) 2 years ago
Frank Lee 7c18a588c8
[autoparallel] added generate_sharding_spec to utils (#1590) 2 years ago
Boyuan Yao 49ccf8b5f8
[fx] Improve linearize and rotor solver (#1586) 2 years ago
Frank Lee 219f66c571
[autoparallel] added solver option dataclass (#1588) 2 years ago
YuliangLiu0306 82d4376c23
[autoparallel] adapt solver with resnet (#1583) 2 years ago
CsRic f3403ff98e
[embeddings] add already_split_along_rank flag for tablewise mode (#1584) 2 years ago
github-actions[bot] 77399dc91b
Automated submodule synchronization (#1550) 2 years ago
Boyuan Yao f3687e4ee2
[fx] Add nested checkpoint in activation checkpoint codegen (#1585) 2 years ago
binmakeswell 1c9ec32734
[NFC] add OPT serving (#1581) 2 years ago
Boyuan Yao 20e466527b [NFC] polish ./colossalai/trainer/hooks/_lr_scheduler_hook.py code style (#1576) 2 years ago
Fazzie-Maqianli 06dccdde44 [NFC] polish colossalai/zero/sharded_model/reduce_scatter.py code style (#1554) 2 years ago
CsRic 2ac46f7be4 [NFC] polish utils/tensor_detector/__init__.py code style (#1573) 2 years ago
Sze-qq 2144cbae8c [NFC] polish colossalai/nn/lr_scheduler/multistep.py code style (#1572) 2 years ago
superhao1995 e4bf7ae667 [NFC] polish colossalai/nn/lr_scheduler/torch.py code style (#1571) 2 years ago
Jiatong Han 3263cdf57f [NFC] polish colossalai/nn/parallel/data_parallel.py code style (#1570) 2 years ago
Zirui Zhu f566c9b98d [NFC] polish colossalai/pipeline/utils.py code style (#1562) 2 years ago
Xue Fuzhao e070ca45c6 [NFC] polish colossalai/fx/tracer/meta_patch/patched_module/convolution.py code style (#1563) 2 years ago
Zangwei Zheng 9823cbf24b [NFC] polish colossalai/gemini/update/chunkv2.py code style (#1565) 2 years ago
DouJS f586887a90 [NFC] polish colossalai/nn/layer/colossalai_layer/dropout.py code style (#1568) 2 years ago
LuGY c7d4932956 [NFC] polish colossalai/utils/tensor_detector/tensor_detector.py code style (#1566) 2 years ago
BigOneLiXiaoMing 0c4c9aa6e0 [NFC] polish colossalai/nn/_ops/embedding.py code style (#1561) 2 years ago
Ziheng Qin 08815f0e72 [NFC] polish colossalai/builder/__init__.py code style (#1560) 2 years ago
Super Daniel 8328917348 [NFC] polish colossalai/testing/comparison.py code style. (#1558) 2 years ago
Ofey Chan 7cc052f6c0 [NFC] polish colossalai/nn/layer/colossalai_layer/linear.py (#1556) 2 years ago
Kai Wang (Victor Kai) 46931e3c32 [NFC] polish code colossalai/gemini/update/search_utils.py (#1557) 2 years ago
yuxuan-lou 413f9c19f4 [NFC] polish colossalai/nn/_ops/layernorm.py code style (#1555) 2 years ago
shenggan 8edb777cc2 [NFC] polish colossalai/nn/loss/loss_2p5d.py code style (#1553) 2 years ago
Maruyama_Aya bd2d789832 [NFC] polish colossalai/nn/_ops/embedding_bag.py code style (#1552) 2 years ago
binmakeswell 73e9eb13b7 [NFC] polish colossalai/nn/lr_scheduler/cosine.py code style 2 years ago
Kirigaya Kazuto 318fbf1145
[NFC] polish colossalai/utils/multi_tensor_apply/multi_tensor_apply.py code style (#1559) 2 years ago
ver217 b0f4c0bddf
update version (#1574) 2 years ago
CsRic a389ac4ec9
[embedding] cache_embedding small improvement (#1564) 2 years ago
ver217 10dd8226b1
add gather_output for VocabParallelClassifier1D (#1569) 2 years ago
アマデウス e615cfc3a8
[NFC] polish test component gpt code style (#1567) 2 years ago
Kirigaya Kazuto 6159d45417
[pipeline/tuning] improve dispatch performance both time and space cost (#1544) 2 years ago
Super Daniel 4f59693207
[fx] provide a stable but not accurate enough version of profiler. (#1547) 2 years ago
github-actions[bot] 7d49e7b2db
Automated submodule synchronization (#1534) 2 years ago
YuliangLiu0306 0908d0fc61
[autoparallel]add backward cost info into strategies (#1524) 2 years ago