ColossalAI/colossalai
Boyuan Yao a7cda6f57d
[fx] Add offload codegen (#1598)
* [fx] add input activation offload to codegen

* [fx] modify unit test

* [fx] remove two skips in torch11

* [fx] use all_input_nodes instead of _input_nodes
2022-09-14 15:49:06 +08:00
..
amp [doc] update rst and docstring (#1351) 2022-07-21 15:54:53 +08:00
auto_parallel [autoparallel] add reshape handler (#1594) 2022-09-14 10:25:45 +08:00
builder [NFC] polish colossalai/builder/__init__.py code style (#1560) 2022-09-08 22:11:04 +08:00
cli [hotfix] fix some bugs caused by size mismatch. (#1011) 2022-05-23 14:02:28 +08:00
communication [communication] add p2p_v2.py to support communication with List[Any] (#1407) 2022-08-09 11:40:04 +08:00
context [autoparallel] refactored shape consistency to remove redundancy (#1591) 2022-09-13 18:30:18 +08:00
device [tensor]add 1D device mesh (#1492) 2022-08-25 16:48:12 +08:00
engine [engin/schedule] use p2p_v2 to recontruct pipeline_schedule (#1408) 2022-08-12 11:33:26 +08:00
fx [fx] Add offload codegen (#1598) 2022-09-14 15:49:06 +08:00
gemini [NFC] polish colossalai/gemini/update/chunkv2.py code style (#1565) 2022-09-08 22:11:04 +08:00
kernel [hotfix] fix CPUAdam kernel nullptr (#1410) 2022-08-05 19:45:45 +08:00
logging [doc] improved docstring in the logging module (#861) 2022-04-25 13:42:00 +08:00
nn [embeddings] add already_split_along_rank flag for tablewise mode (#1584) 2022-09-13 10:50:34 +08:00
pipeline [NFC] polish colossalai/pipeline/utils.py code style (#1562) 2022-09-08 22:11:04 +08:00
registry Remove duplication registry (#1078) 2022-06-08 07:47:24 +08:00
tensor [autoparallel] refactored shape consistency to remove redundancy (#1591) 2022-09-13 18:30:18 +08:00
testing [NFC] polish colossalai/testing/comparison.py code style. (#1558) 2022-09-08 22:11:04 +08:00
trainer [NFC] polish ./colossalai/trainer/hooks/_lr_scheduler_hook.py code style (#1576) 2022-09-08 22:11:04 +08:00
utils [NFC] polish utils/tensor_detector/__init__.py code style (#1573) 2022-09-08 22:11:04 +08:00
zero [NFC] polish colossalai/zero/sharded_model/reduce_scatter.py code style (#1554) 2022-09-08 22:11:04 +08:00
__init__.py [fx] provide a stable but not accurate enough version of profiler. (#1547) 2022-09-07 11:21:04 +08:00
_meta_registrations.py [fx] provide a stable but not accurate enough version of profiler. (#1547) 2022-09-07 11:21:04 +08:00
constants.py fix typo in constants (#1027) 2022-05-26 08:45:08 +08:00
core.py [Tensor] distributed view supports inter-process hybrid parallel (#1169) 2022-06-27 09:45:26 +08:00
global_variables.py [MOE] add unitest for MOE experts layout, gradient handler and kernel (#469) 2022-03-21 13:35:04 +08:00
initialize.py [hotfix] remove potiential circle import (#1307) 2022-07-14 13:44:26 +08:00