FoolPlayer
|
4021b9a8a2
|
[shardformer] add gpt2 test and layer class refactor (#4041)
* add gpt2 test and layer class refactor
* add dropout in gpt2 policy
|
2023-07-04 16:05:01 +08:00 |
FoolPlayer
|
45927d5527
|
[shardformer] Add dropout layer in shard model and refactor policy api (#3949)
* add dist dropout in model
* update docstring and bert policy with dropout
* refactor basepolicy and sharded, update bert
* update format
* update gpt2 policy
* update bert policy
* remove unused code
* update readme for new policy usage
|
2023-07-04 16:05:01 +08:00 |
FoolPlayer
|
79f8d5d54b
|
[shardformer] add gpt2 policy and modify shard and slicer to support (#3883)
* add gpt2 policy and modify shard and slicer to support
* remove unused code
* polish code
|
2023-07-04 16:05:01 +08:00 |
Frank Lee
|
ddcf58cacf
|
Revert "[sync] sync feature/shardformer with develop"
|
2023-06-09 09:41:27 +08:00 |
FoolPlayer
|
ef1537759c
|
[shardformer] add gpt2 policy and modify shard and slicer to support (#3883)
* add gpt2 policy and modify shard and slicer to support
* remove unused code
* polish code
|
2023-06-08 15:01:34 +08:00 |