Commit Graph

2324 Commits (b37797ed3d3d6af294a095397b4bc135264b8c6a)
 

Author SHA1 Message Date
Alex_996 a4fc125c34
Fix typos (#2863)
2 years ago
dawei-wang 55424a16a5
[doc] fix GPT tutorial (#2860)
2 years ago
Boyuan Yao eae77c831d
[autoparallel] Patch meta information for nodes that will not be handled by SPMD solver (#2823)
2 years ago
Boyuan Yao c7764d3f22
[autoparallel] Patch meta information of `torch.where` (#2822)
2 years ago
Boyuan Yao fcc4097efa
[autoparallel] Patch meta information of `torch.tanh()` and `torch.nn.Dropout` (#2773)
2 years ago
BlueRum 34ca324b0d
[chatgpt] Support saving ckpt in examples (#2846)
2 years ago
Zheng Zeng 597914317b
[doc] fix typo in opt inference tutorial (#2849)
2 years ago
Frank Lee 935346430f
[cli] handled version check exceptions (#2848)
2 years ago
BlueRum 3eebc4dff7
[chatgpt] fix rm eval (#2829)
2 years ago
Frank Lee 918bc94b6b
[triton] added copyright information for flash attention (#2835)
2 years ago
Boyuan Yao 7ea6bc7f69
[autoparallel] Patch tensor related operations meta information (#2789)
2 years ago
github-actions[bot] a5721229d9
Automated submodule synchronization (#2740)
2 years ago
Haofan Wang 47ecb22387
[example] add LoRA support (#2821)
2 years ago
ver217 b6a108cb91
[chatgpt] add test checkpoint (#2797)
2 years ago
Michelle c008d4ad0c
[NFC] polish colossalai/engine/schedule/_pipeline_schedule.py code style (#2744)
2 years ago
mickogoin 58abde2857
Update README.md (#2791)
2 years ago
Marco Rodrigues 89f0017a9c
Typo (#2826)
2 years ago
Jiarui Fang bf0204604f
[exmaple] add bert and albert (#2824)
2 years ago
YuliangLiu0306 cf6409dd40
Hotfix/auto parallel zh doc (#2820)
2 years ago
YuliangLiu0306 2059fdd6b0
[hotfix] add copyright for solver and device mesh (#2803)
2 years ago
LuGY dbd0fd1522
[CI/CD] fix nightly release CD running on forked repo (#2812)
2 years ago
Boyuan Yao 8593ae1a3f
[autoparallel] rotor solver refactor (#2813)
2 years ago
binmakeswell 09f457479d
[doc] update OPT serving (#2804)
2 years ago
HELSON 56ddc9ca7a
[hotfix] add correct device for fake_param (#2796)
2 years ago
ver217 a619a190df
[chatgpt] update readme about checkpoint (#2792)
2 years ago
ver217 4ee311c026
[chatgpt] startegy add prepare method (#2766)
2 years ago
Boyuan Yao a2b43e393d
[autoparallel] Patch meta information of `torch.nn.Embedding` (#2760)
2 years ago
Boyuan Yao 8e3f66a0d1
[zero] fix wrong import (#2777)
2 years ago
Fazzie-Maqianli ba84cd80b2
fix pip install colossal (#2764)
2 years ago
Nikita Shulga 01066152f1
Don't use `torch._six` (#2775)
2 years ago
ver217 a88bc828d5
[chatgpt] disable shard init for colossalai (#2767)
2 years ago
binmakeswell d6d6dec190
[doc] update example and OPT serving link (#2769)
2 years ago
Frank Lee e376954305
[doc] add opt service doc (#2747)
2 years ago
BlueRum 613efebc5c
[chatgpt] support colossalai strategy to train rm (#2742)
2 years ago
BlueRum 648183a960
[chatgpt]fix train_rm bug with lora (#2741)
2 years ago
fastalgo b6e3b955c3
Update README.md
2 years ago
binmakeswell 30aee9c45d
[NFC] polish code format
2 years ago
YuliangLiu0306 1dc003c169
[autoparallel] distinguish different parallel strategies (#2699)
2 years ago
YH ae86a29e23
Refact method of grad store (#2687)
2 years ago
cloudhuang 43dffdaba5
[doc] fixed a typo in GPT readme (#2736)
2 years ago
binmakeswell 93b788b95a Merge branch 'main' into fix/format
2 years ago
xyupeng 2fd528b9f4
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/graph_analysis.py code style (#2737)
2 years ago
Zirui Zhu c9e3ee389e
[NFC] polish colossalai/context/process_group_initializer/initializer_2d.py code style (#2726)
2 years ago
Zangwei Zheng 1819373e5c
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/batch_norm_handler.py code style (#2728)
2 years ago
Wangbo Zhao(黑色枷锁) 8331420520
[NFC] polish colossalai/cli/cli.py code style (#2734)
2 years ago
Frank Lee 5479fdd5b8
[doc] updated documentation version list (#2730)
2 years ago
binmakeswell c5be83afbf
Update version.txt (#2727)
2 years ago
ziyuhuang123 d344313533
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/embedding_handler.py code style (#2725)
2 years ago
Xue Fuzhao e81caeb4bc
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/cost_graph.py code style (#2720)
2 years ago
yuxuan-lou 51c45c2460
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/where_handler.py code style (#2723)
2 years ago