Commit Graph

  • fcf426de3d fixed bug in activation checkpointing test FrankLeeeee 2022-03-11 06:33:08 +0000
  • 73f03c444d
    Merge branch 'develop' into tensor_detect LuGY 2022-03-11 14:31:57 +0800
  • 1b319c6818 add offload unittest for ShardedOptimV2. jiaruifang 2022-03-11 14:27:39 +0800
  • b84f388177 added tensor detector lclgy 2022-03-11 14:23:06 +0800
  • ffe80359d3 Added activation offload (#331) LuGY 2022-03-11 10:08:10 +0800
  • da80d1dcff find a bug in sharded optim unittest jiaruifang 2022-03-11 14:15:08 +0800
  • 4b3072953b Merge branch 'develop' of https://github.com/hpcaitech/ColossalAI into develop jiaruifang 2022-03-11 14:14:18 +0800
  • 86a3c96856 update README and images path (#384) binmakeswell 2022-03-11 13:53:38 +0800
  • 8633409abf fix format (#379) ScalableEKNN 2022-03-10 18:35:41 +0800
  • f3a05d0134 fix format (#376) Jiang Zhuo 2022-03-10 17:15:59 +0800
  • 392ea27841 fix format (#374) lucasliunju 2022-03-10 16:12:51 +0800
  • 5814c6aecd
    [unit test] Refactored test cases with component func (#339) Frank Lee 2022-03-11 14:09:09 +0800
  • ac3eb8c086
    update README and images path (#384) binmakeswell 2022-03-11 13:53:38 +0800
  • 692fcc056d update README and images path binmakeswell 2022-03-11 13:30:23 +0800
  • 101c437df0 Merge branch 'develop' of https://github.com/hpcaitech/ColossalAI into develop jiaruifang 2022-03-11 13:09:10 +0800
  • 743b7cb7f5 fixed bug FrankLeeeee 2022-03-11 04:10:33 +0000
  • f5f374effb polish code jiaruifang 2022-03-11 12:07:02 +0800
  • a0be2a446f memstats collector jiaruifang 2022-03-11 12:01:24 +0800
  • 3f6bd5c22b refactored test with component func FrankLeeeee 2022-03-09 02:49:53 +0000
  • 2eaf654d9e
    Added activation offload (#331) LuGY 2022-03-11 10:08:10 +0800
  • 08919002b7 Merge branch 'develop' of https://github.com/hpcaitech/ColossalAI into develop jiaruifang 2022-03-10 23:00:37 +0800
  • 71d5d15db8
    [bug] shard param during initializing the ShardedModelV2 (#381) Jiarui Fang 2022-03-10 19:28:03 +0800
  • 24391c709b hotfix a bug. shard param using ShardedModelV2 jiaruifang 2022-03-10 19:23:36 +0800
  • de594139b6 Merge branch 'develop' of https://github.com/hpcaitech/ColossalAI into develop jiaruifang 2022-03-10 19:23:06 +0800
  • 4364c685d4
    fix format (#379) ScalableEKNN 2022-03-10 18:35:41 +0800
  • aef23bbc4c fix format ScalableEKNN2021 2022-03-10 18:32:52 +0800
  • 3bb5c05607
    [profiler] Fixed bugs in CommProfiler and PcieProfiler (#377) HELSON 2022-03-10 17:54:55 +0800
  • dee30d36c1
    [zero] find miss code (#378) Jiarui Fang 2022-03-10 17:51:50 +0800
  • 2142e701ac Merge branch 'jiaruifang/bucket_tensor_copy' into develop jiaruifang 2022-03-10 17:50:30 +0800
  • edad17c970 Fixed bugs in CommProfiler and PcieProfiler 1SAA 2022-03-10 17:42:26 +0800
  • 1bff4b685a
    fix format (#376) Jiang Zhuo 2022-03-10 17:15:59 +0800
  • a6230bb722 fix format 姜卓 2022-03-10 17:04:02 +0800
  • 0d4892d1a1
    [zero] zero init context collect numel of model (#375) Jiarui Fang 2022-03-10 16:31:02 +0800
  • 2349560a79 zero init context collect numel of model jiaruifang 2022-03-10 16:25:25 +0800
  • 00ab6a107a
    Added PCIE profiler to dectect data transmission (#373) HELSON 2022-03-10 16:24:57 +0800
  • 0d5cdb1d1a Added PCIE profiler to dectect data transmission 1SAA 2022-03-10 14:10:33 +0800
  • b34060b5dc
    fix format (#374) lucasliunju 2022-03-10 16:12:51 +0800
  • 1dbdbe03ad fix format “lucasliunju” 2022-03-10 16:09:27 +0800
  • 6aa203e634
    Merge pull request #372 from hpcaitech/fix/format Jiarui Fang 2022-03-10 15:41:38 +0800
  • 97d942b772 Revert "[zero] bucketized tensor cpu gpu copy (#368)" jiaruifang 2022-03-10 15:39:09 +0800
  • 5f667a687f
    [polish] fix format (#370) binmakeswell 2022-03-10 15:35:06 +0800
  • eb2beab341 add TODO jiaruifang 2022-03-10 15:30:04 +0800
  • 9ac90bda00 add bucket tensor copy to OptimV2 jiaruifang 2022-03-10 15:21:07 +0800
  • 3e587217c9 Update README-zh-Hans.md (#367) Xue Fuzhao 2022-03-10 13:49:50 +0800
  • b7a7d4c994 Fix/format (#366) Shen Chenhui 2022-03-10 13:32:56 +0800
  • 99ae7a36ee fix format (#364) Ziheng Qin 2022-03-10 12:12:42 +0800
  • de01b0777c flake8 style change (#363) RichardoLuo 2022-03-10 11:47:51 +0800
  • 14b89485c4 fix format (#362) Kai Wang (Victor Kai) 2022-03-10 11:33:21 +0800
  • dd6ef62c67 fix format parallel_context.py (#359) ziyu huang 2022-03-10 09:29:32 +0800
  • bdfe8e55bc fix format constants.py (#358) Zangwei 2022-03-09 23:35:41 +0800
  • d25ca86723 fix format parallel_2p5d (#357) Yuer867 2022-03-09 21:42:30 +0800
  • e3ee7b621e flake8 style (#352) Liang Bowen 2022-03-09 17:34:43 +0800
  • 01fd91e859 Fix/format colossalai/engine/paramhooks/(#350) Xu Kai 2022-03-09 17:28:17 +0800
  • 42f4cd5411 fix format ColossalAI\colossalai\context\process_group_initializer Maruyama_Aya 2022-03-09 16:23:33 +0800
  • 7f1a485660 Flake8 code restyle yuxuan-lou 2022-03-09 15:17:01 +0800
  • 2463bc10d6 fix format setup.py (#343) xyupeng 2022-03-09 15:11:35 +0800
  • 05582588b0 Qifan formated file ColossalAI\colossalai\nn\layer\parallel_1d\layers.py (#342) xuqifan897 2022-03-08 22:45:27 -0800
  • 423d232e49 fix format (#332) Cautiousss 2022-03-09 10:35:05 +0800
  • 9187edd4f1 fix format for dir-[parallel_3d] (#333) DouJS 2022-03-09 10:31:43 +0800
  • afd442cb2e [formart] format fixed for kernel\cuda_native codes (#335) ExtremeViscent 2022-03-09 01:44:20 +0000
  • bef05489b6
    [zero] bucketized tensor cpu gpu copy (#368) Jiarui Fang 2022-03-10 14:41:08 +0800
  • 2706055ddb bucketzed cpu gpu tensor transter jiaruifang 2022-03-10 14:26:13 +0800
  • 2f95df5934
    [zero] able to place params on cpu after zero init context (#365) Jiarui Fang 2022-03-10 14:08:58 +0800
  • c75b06b9dc Fixed the import bug, used the pytest lclgy 2022-03-10 14:04:44 +0800
  • 18ca0936f0
    Update README-zh-Hans.md Xue Fuzhao 2022-03-10 13:47:38 +0800
  • d70d2fad1f
    Fix/format (#366) Shen Chenhui 2022-03-10 13:32:56 +0800
  • 1a8d5eb173
    Merge branch 'hpcaitech:fix/format' into fix/format Shen Chenhui 2022-03-10 13:30:06 +0800
  • 0a56998b2b fix format Shen Chenhui 2022-03-10 13:24:13 +0800
  • 55a358fdf1 fix format Shen Chenhui 2022-03-10 12:39:26 +0800
  • 0815f42ea5
    fix format (#364) Ziheng Qin 2022-03-10 12:12:42 +0800
  • 20911f03e1 polish code jiaruifang 2022-03-10 12:09:40 +0800
  • 72a717f4de increase the timeout limit in CI temporarily ver217 2022-03-10 11:48:20 +0800
  • b4ad1e988a increase the timeout limit in CI temporarily ver217 2022-03-10 11:00:31 +0800
  • 49196b8276 fix grad shape ver217 2022-03-09 18:03:39 +0800
  • 60004043de place params on cpu after zero init context jiaruifang 2022-03-10 12:05:16 +0800
  • 01dc334c53 fix format Ziheng Qin 2022-03-10 12:04:29 +0800
  • b7c2fa3c95 increase the timeout limit in CI temporarily ver217 2022-03-10 11:48:20 +0800
  • c6acfd1e63
    flake8 style change (#363) RichardoLuo 2022-03-10 11:47:51 +0800
  • 3c2ac2ec90 flake8 style change RichardoLuo 2022-03-10 11:43:40 +0800
  • 038f17eb37
    fix format (#362) Kai Wang (Victor Kai) 2022-03-10 11:33:21 +0800
  • 55ad8a41c0 fix format kai.wang960112@gmail.com 2022-03-10 11:31:53 +0800
  • 7650177713
    [zero] global model data memory tracer (#360) Jiarui Fang 2022-03-10 11:20:04 +0800
  • 1d96afc115 polish code jiaruifang 2022-03-10 11:01:59 +0800
  • 80d47e8ce7 increase the timeout limit in CI temporarily ver217 2022-03-10 11:00:31 +0800
  • c60627ff46 Merge branch 'develop' of https://github.com/hpcaitech/ColossalAI into jiaruifang/global_modeldata_tracer jiaruifang 2022-03-10 10:55:45 +0800
  • 7c12e472fb polish code jiaruifang 2022-03-10 10:54:36 +0800
  • 4cbdd7330f add global model data memory usage tracer. jiaruifang 2022-03-10 10:51:18 +0800
  • 632ef18347 fix grad shape ver217 2022-03-09 18:03:39 +0800
  • 56f3d80961
    [test] polish zero related unitest (#351) Jiarui Fang 2022-03-10 09:57:26 +0800
  • fb36f81826
    fix format parallel_context.py (#359) ziyu huang 2022-03-10 09:29:32 +0800
  • e2a880e465 shrink unitest elapse jiaruifang 2022-03-10 09:27:39 +0800
  • 68ab02f47d fix format C:\Users\20247\source\repos\CUDA 11.3 Runtime1\CUDA 11.3 Runtime1\ColossalAI\colossalai\context\parallel_context.py huangziyu 2022-03-10 09:24:27 +0800
  • b329788c67
    fix format constants.py (#358) Zangwei 2022-03-09 23:35:41 +0800
  • ffca091b1e fix format constants.py zhengzangw 2022-03-09 23:31:12 +0800
  • 34a2b5333d
    fix format parallel_2p5d (#357) Yuer867 2022-03-09 21:42:30 +0800
  • 9f13907e17 fix format parallel_2p5d Yuer867 2022-03-09 21:31:57 +0800
  • 4ac58ac898
    Fixed import bug for no-tensorboard environment (#354) HELSON 2022-03-09 19:48:04 +0800
  • c03b2c37ba
    add interface of MemProfiler Jie Zhu 2022-03-09 14:02:38 +0800
  • 42d1035bfe
    modify `Engine` to support dynamically add/remove ophooks Jie Zhu 2022-03-09 13:57:54 +0800
  • 4e1b51585a
    fix merge conflict Jie Zhu 2022-03-08 16:07:22 +0800