Commit Graph

1587 Commits (b965585d057d7038d4a72491bb79b1efff30e6cc)
 

Author SHA1 Message Date
xyupeng b965585d05 [NFC] polish colossalai/amp/torch_amp/torch_amp.py code style (#2290)
2 years ago
Zangwei Zheng d1e5bafcd4 [NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/__init__.py code style (#2291)
2 years ago
shenggan 950685873f [NFC] polish colossalai/auto_parallel/tensor_shard/deprecated/op_handler/reshape_handler.py code style (#2292)
2 years ago
Ziheng Qin 3041014089 [NFC] polish colossalai/amp/naive_amp/grad_scaler/dynamic_grad_scaler.py code style (#2299)
2 years ago
アマデウス 49715a78f0 [NFC] polish colossalai/cli/benchmark/benchmark.py code style (#2287)
2 years ago
Zirui Zhu 1c29b173c9 [NFC] polish colossalai/auto_parallel/tensor_shard/node_handler/getitem_handler.py code style (#2289)
2 years ago
Zihao 3a02b46447
[auto-parallel] refactoring ColoTracer (#2118)
2 years ago
Jiarui Fang 32253315b4
[example] update diffusion readme with official lightning (#2304)
2 years ago
HELSON 5d3a2be3af
[amp] add gradient clipping for unit tests (#2283)
2 years ago
HELSON e00cedd181
[example] update gemini benchmark bash (#2306)
2 years ago
Frank Lee 9b765e7a69
[setup] removed the build dependency on colossalai (#2307)
2 years ago
Boyuan Yao d45695d94e
Merge pull request #2258 from hpcaitech/debug/ckpt-autoparallel
2 years ago
binmakeswell c8144223b8
[doc] update diffusion doc (#2296)
2 years ago
binmakeswell 2fac699923
[doc] update news (#2295)
2 years ago
binmakeswell 4b72b2d4d3
[doc] update news
2 years ago
Jiarui Fang 16cc8e6aa7
[builder] MOE builder (#2277)
2 years ago
Boyuan Yao b904748210
[autoparallel] bypass MetaInfo when unavailable and modify BCAST_FUNC_OP metainfo (#2293)
2 years ago
Jiarui Fang 26e171af6c
[version] 0.1.14 -> 0.2.0 (#2286)
2 years ago
Super Daniel 8ea50d999e
[hotfix] pass a parameter. (#2288)
2 years ago
ZijianYY df1d6dc553
[examples] using args and combining two versions for PaLM (#2284)
2 years ago
zbian e94c79f15b improved allgather & reducescatter for 3d
2 years ago
binmakeswell c719798abe
[doc] add feature diffusion v2, bloom, auto-parallel (#2282)
2 years ago
HELSON 62c38e3330
[zero] polish low level zero optimizer (#2275)
2 years ago
Ziyue Jiang ac863a01d6
[example] add benchmark (#2276)
2 years ago
Boyuan Yao 22e947f982
[autoparallel] fix runtime apply memory estimation (#2281)
2 years ago
BlueRum 1405b4381e
[example] fix save_load bug for dreambooth (#2280)
2 years ago
Super Daniel 8e8900ff3f
[autockpt] considering parameter and optimizer weights. (#2279)
2 years ago
YuliangLiu0306 f027ef7913
[hotfix] fix fp16 optimzier bug (#2273)
2 years ago
YuliangLiu0306 fb87322773
[autoparallel] fix spelling error (#2270)
2 years ago
Jiarui Fang af32022f74
[Gemini] fix the convert_to_torch_module bug (#2269)
2 years ago
Jiarui Fang 879df8b943
[example] GPT polish readme (#2274)
2 years ago
Ziyue Jiang 9654df0e9a
Add GPT PP Example (#2272)
2 years ago
Super Daniel b0d21d0c4f
[autockpt] linearize / merge shape-consistency nodes. (#2271)
2 years ago
YuliangLiu0306 4b29112ab2
[autoparallel] gpt2 autoparallel examples (#2267)
2 years ago
Ziyue Jiang 8b045b3c1f
[Pipeline Middleware] Reduce comm redundancy by getting accurate output (#2232)
2 years ago
HELSON 09c0102fe6
[example] fix gpt example with 0.1.10 (#2265)
2 years ago
Boyuan Yao 5c2ef9fc76
[autoparallel] modify comm nodes' memory cost in construct chain (#2263)
2 years ago
Fazzie-Maqianli 89f048a88a
[example] clear diffuser image (#2262)
2 years ago
Boyuan Yao 1ea99b869e
[autoparallel] align the data_ptr with the old version of auto activation checkpoint pipeline (#2261)
2 years ago
Super Daniel 3ccf58aa76
[autockpt] make it work. (#2257)
2 years ago
Boyuan Yao ac3739930d
[autoparallel] modify construct chain in rotor solver (#2254)
2 years ago
Boyuan Yao ab38aebace
[autoparallel] Hook all meta information on ResNet nodes for auto activation checkpoint (#2248)
2 years ago
Boyuan Yao c8c79102f0
[autoparallel] patch torch.flatten metainfo for autoparallel (#2247)
2 years ago
YuliangLiu0306 8897b8f753
[autoparallel] autoparallel initialize (#2238)
2 years ago
xcnick 85178a397a
[hotfix] fix error for torch 2.0 (#2243)
2 years ago
Super Daniel b7d0990c61
[autoparallel] fix construct meta info. (#2245)
2 years ago
Frank Lee 89542ceb44
[doc] updated the stable diffussion on docker usage (#2244)
2 years ago
Jiarui Fang 50cdf5430e
[example] diffusion install from docker (#2239)
2 years ago
Ziyue Jiang 57929a6210
fix type of num_worker_threads (#2237)
2 years ago
Jiarui Fang db4cbdc7fb
[builder] builder for scaled_upper_triang_masked_softmax (#2234)
2 years ago