Frank Lee
85e045b063
[doc] updated readme regarding pypi installation ( #2406 )
2 years ago
Frank Lee
d3f5ce9efb
[workflow] added nightly release to pypi ( #2403 )
2 years ago
HELSON
498b5ca993
[hotfix] fix gpt gemini example ( #2404 )
...
* [hotfix] fix gpt gemini example
* [example] add new assertions
2 years ago
eric8607242
9880fd2cd8
Fix state_dict key missing issue of the ZeroDDP ( #2363 )
...
* Fix state_dict output for ZeroDDP duplicated parameters
* Rewrite state_dict based on get_static_torch_model
* Modify get_static_torch_model to be compatible with the lower version (ZeroDDP)
2 years ago
Frank Lee
ce08661eb1
[cli] updated installation check cli for aot/jit build ( #2395 )
2 years ago
Frank Lee
2add870138
[workflow] added missing file change detection output ( #2387 )
2 years ago
Jiarui Fang
4898ff8af4
[doc] hotfix #2377
2 years ago
jiaruifang
b2e0d502b8
[doc] hotfix #2377
2 years ago
Jiarui Fang
58b013060f
[hotfix] issue #2388
2 years ago
jiaruifang
69d9180c4b
[hotfix] issue #2388
2 years ago
Jiarui Fang
4e96039649
[device] find best logical mesh
2 years ago
Jiarui Fang
8f72b6f8fb
[hotfix] fix implement error in diffusers
2 years ago
Frank Lee
40d376c566
[setup] support pre-build and jit-build of cuda kernels ( #2374 )
...
* [setup] support pre-build and jit-build of cuda kernels
* polish code
* polish code
* polish code
* polish code
* polish code
* polish code
2 years ago
1SAA
33f3023e19
[hotfix] fix implement error in diffusers
2 years ago
Jiarui Fang
12c8bf38d7
[Pipeline] Refine GPT PP Example
2 years ago
Jiarui Fang
c3d9e23277
[builder] correct readme ( #2375 )
...
* [example] add google doc for benchmark results of GPT
* add tencet doc
* [example] gpt, shard init on all processes
* polish comments
* polish code
* [builder] update readme
2 years ago
Ziyue Jiang
ad00894f7f
polish
2 years ago
binmakeswell
a881d6d000
Revert "[NFC] polish code format" ( #2372 )
2 years ago
Ziyue Jiang
9ae9e74017
fix diff device in some partition
2 years ago
Jiarui Fang
0dcc410f57
[NFC] polish code format
2 years ago
binmakeswell
d634eae05b
Revert "[NFC] polish code format ( #2367 )" ( #2371 )
...
This reverts commit 1f8ab6f1f5
.
2 years ago
Shawn-Kong
d42aecdda1
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/embedding_handler.py code style ( #2368 )
2 years ago
Jiarui Fang
1aaeb596c6
[example] gpt, shard init on all processes ( #2366 )
2 years ago
binmakeswell
1f8ab6f1f5
[NFC] polish code format ( #2367 )
2 years ago
Ziyue Jiang
3a15b20421
Move GPT PP Example
2 years ago
ExtremeViscent
ac0d30fe2e
[NFC] polish batch_norm_handler.py code style ( #2359 )
2 years ago
HELSON
48d33b1b17
[gemini] add get static torch model ( #2356 )
2 years ago
Fazzie-Maqianli
7a332b1734
Merge pull request #2338 from haofanwang/patch-1
...
Fix a typo in train_dreambooth_colossalai.py
2 years ago
YuliangLiu0306
8b1e0dfd80
[example] upload auto parallel gpt2 demo ( #2354 )
2 years ago
Jiarui Fang
00a9c781fd
[example] add google doc for benchmark results of GPT ( #2355 )
2 years ago
Jiarui Fang
509a87f3ff
[example] make gpt example directory more clear ( #2353 )
2 years ago
Arsmart1
7027540d3d
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/_utils.py code style ( #2352 )
2 years ago
Ikko Eltociear Ashimine
5e4bced0a3
[NFC] Update roberta/README.md ( #2350 )
2 years ago
Jiarui Fang
35e22be2f6
[example] simplify opt example ( #2344 )
2 years ago
ziyuhuang123
7080a8edb0
[workflow]New version: Create workflow files for examples' auto check ( #2298 )
...
* [workflows]bug_repair
* [workflow]new_pr_fixing_bugs
Co-authored-by: binmakeswell <binmakeswell@gmail.com>
2 years ago
binmakeswell
d7352bef2c
[example] add example requirement ( #2345 )
2 years ago
LuGY
e11a005c02
[NFC] polish colossalai/auto_parallel/tensor_shard/utils/factory.py code style ( #2349 )
2 years ago
Haofan Wang
7ce965c7cc
Update requirement_colossalai.txt ( #2348 )
2 years ago
ZijianYY
f7fd592bf4
[examples]adding tp to PaLM ( #2319 )
2 years ago
YuliangLiu0306
b5a3a4a65f
[device] find best logical mesh
2 years ago
yuxuan-lou
28e2d16794
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/graph_analysis.py code style ( #2340 )
2 years ago
YuliangLiu0306
9c9246c0d9
[device] alpha beta profiler ( #2311 )
...
* [device] alpha beta profiler
* add usage
* fix variable name
2 years ago
Maruyama_Aya
bd12a49e2a
[NFC] polish <colossalai/auto_parallel/tensor_shard/deprecated/constants.py> code style ( #2339 )
2 years ago
Haofan Wang
9edd0aa75e
Update train_dreambooth_colossalai.py
...
accelerator.num_processes -> gpc.get_world_size(ParallelMode.DATA)
2 years ago
Frank Lee
f1bc2418c4
[setup] make cuda extension build optional ( #2336 )
...
* [setup] make cuda extension build optional
* polish code
* polish code
* polish code
2 years ago
Frank Lee
8711310cda
[setup] remove torch dependency ( #2333 )
2 years ago
Zihao
35427bcab4
[NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/unary_elementwise_handler.py code style ( #2326 )
2 years ago
Fazzie-Maqianli
89f26331e9
[example] diffusion update diffusion,Dreamblooth ( #2329 )
2 years ago
Frank Lee
6e34cc0830
[workflow] fixed pypi release workflow error ( #2328 )
2 years ago
Frank Lee
2916eed34a
[workflow] fixed pypi release workflow error ( #2327 )
2 years ago