ColossalAI/colossalai/shardformer/policies
Hongxin Liu 890774b2fb [shardformer] support lazy init (#4202)
* [shardformer] support lazy init

* [shardformer] linear support lazy init

* [shardformer] embedding support lazy init

* [shardformer] norm support lazy init

* [shardformer] fused linear support lazy init

* [test] update shardformer test layer

* [test] shardformer with lazy init fit ddp

* [lazy] hotfix deepcopy of param

* [shardformer] fix bert policy and update test

* [shardformer] fix bloom policy and update test

* [shardformer] fix opt policy and update test

* [shardformer] fix t5 policy and update test

* [shardformer] fix gpt2 policy and update test

* [shardformer] fix llama policy and update test
2023-08-15 23:25:14 +08:00
..
__init__.py [shardformer] init shardformer code structure (#3731) 2023-07-04 16:05:01 +08:00
auto_policy.py [pipeline] move bert related pipeline components to shardformer (#4187) 2023-08-15 23:25:14 +08:00
base_policy.py [pipeline] move bert related pipeline components to shardformer (#4187) 2023-08-15 23:25:14 +08:00
bert.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
bloom.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
gpt2.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
llama.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
opt.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
t5.py [shardformer] support lazy init (#4202) 2023-08-15 23:25:14 +08:00
vit.py [shardformer] rename policy file name 2023-08-15 23:25:14 +08:00