Commit Graph

2180 Commits (31c78f2be3272a9a4062fe78eca34b3847a0c900)
 

Author SHA1 Message Date
Jiatong (Julius) Han 8c8a39be95
[hotfix]: Remove math.prod dependency (#2837)
2 years ago
YuliangLiu0306 819e25d8b1
[hotfix] fix autoparallel compatibility test issues (#2754)
2 years ago
YuliangLiu0306 0f392d7403
[autoparallel] find repeat blocks (#2854)
2 years ago
BlueRum 2e16f842a9
[chatgpt]support opt & gpt for rm training (#2876)
2 years ago
junxu c52edcf0eb
Rename class method of ZeroDDP (#2692)
2 years ago
HELSON 6e4ac08172
[hotfix] fix chunk size can not be divided (#2867)
2 years ago
Alex_996 a4fc125c34
Fix typos (#2863)
2 years ago
dawei-wang 55424a16a5
[doc] fix GPT tutorial (#2860)
2 years ago
Boyuan Yao eae77c831d
[autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823)
2 years ago
Boyuan Yao c7764d3f22
[autoparallel] Patch meta information of `torch.where` (#2822)
2 years ago
Boyuan Yao fcc4097efa
[autoparallel] Patch meta information of `torch.tanh()` and `torch.nn.Dropout` (#2773)
2 years ago
BlueRum 34ca324b0d
[chatgpt] Support saving ckpt in examples (#2846)
2 years ago
Zheng Zeng 597914317b
[doc] fix typo in opt inference tutorial (#2849)
2 years ago
Frank Lee 935346430f
[cli] handled version check exceptions (#2848)
2 years ago
BlueRum 3eebc4dff7
[chatgpt] fix rm eval (#2829)
2 years ago
Frank Lee 918bc94b6b
[triton] added copyright information for flash attention (#2835)
2 years ago
Boyuan Yao 7ea6bc7f69
[autoparallel] Patch tensor related operations meta information (#2789)
2 years ago
github-actions[bot] a5721229d9
Automated submodule synchronization (#2740)
2 years ago
Haofan Wang 47ecb22387
[example] add LoRA support (#2821)
2 years ago
ver217 b6a108cb91
[chatgpt] add test checkpoint (#2797)
2 years ago
Michelle c008d4ad0c
[NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744)
2 years ago
mickogoin 58abde2857
Update README.md (#2791)
2 years ago
Marco Rodrigues 89f0017a9c
Typo (#2826)
2 years ago
Jiarui Fang bf0204604f
[exmaple] add bert and albert (#2824)
2 years ago
YuliangLiu0306 cf6409dd40
Hotfix/auto parallel zh doc (#2820)
2 years ago
YuliangLiu0306 2059fdd6b0
[hotfix] add copyright for solver and device mesh (#2803)
2 years ago
LuGY dbd0fd1522
[CI/CD] fix nightly release CD running on forked repo (#2812)
2 years ago
Boyuan Yao 8593ae1a3f
[autoparallel] rotor solver refactor (#2813)
2 years ago
binmakeswell 09f457479d
[doc] update OPT serving (#2804)
2 years ago
HELSON 56ddc9ca7a
[hotfix] add correct device for fake_param (#2796)
2 years ago
ver217 a619a190df
[chatgpt] update readme about checkpoint (#2792)
2 years ago
ver217 4ee311c026
[chatgpt] startegy add prepare method (#2766)
2 years ago
Boyuan Yao a2b43e393d
[autoparallel] Patch meta information of `torch.nn.Embedding` (#2760)
2 years ago
Boyuan Yao 8e3f66a0d1
[zero] fix wrong import (#2777)
2 years ago
Fazzie-Maqianli ba84cd80b2
fix pip install colossal (#2764)
2 years ago
Nikita Shulga 01066152f1
Don't use `torch._six` (#2775)
2 years ago
ver217 a88bc828d5
[chatgpt] disable shard init for colossalai (#2767)
2 years ago
binmakeswell d6d6dec190
[doc] update example and OPT serving link (#2769)
2 years ago
Frank Lee e376954305
[doc] add opt service doc (#2747)
2 years ago
BlueRum 613efebc5c
[chatgpt] support colossalai strategy to train rm (#2742)
2 years ago
BlueRum 648183a960
[chatgpt]fix train_rm bug with lora (#2741)
2 years ago
fastalgo b6e3b955c3
Update README.md
2 years ago
binmakeswell 30aee9c45d
[NFC] polish code format
2 years ago
YuliangLiu0306 1dc003c169
[autoparallel] distinguish different parallel strategies (#2699)
2 years ago
YH ae86a29e23
Refact method of grad store (#2687)
2 years ago
cloudhuang 43dffdaba5
[doc] fixed a typo in GPT readme (#2736)
2 years ago
binmakeswell 93b788b95a Merge branch 'main' into fix/format
2 years ago
xyupeng 2fd528b9f4
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/graph_analysis.py code style (#2737)
2 years ago
Zirui Zhu c9e3ee389e
[NFC] polish colossalai/context/process_group_initializer/initializer_2d.py code style (#2726)
2 years ago
Zangwei Zheng 1819373e5c
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/batch_norm_handler.py code style (#2728)
2 years ago