ColossalAI/colossalai/shardformer/shard
flybird11111 5e16bf7980
[shardformer] fix gathering output when using tensor parallelism (#5431)
* fix

* padding vocab_size when using pipeline parallellism

padding vocab_size when using pipeline parallellism

fix

fix

* fix

* fix

fix

fix

* fix gather output

* fix

* fix

* fix

fix resize embedding

fix resize embedding

* fix resize embedding

fix

* revert

* revert

* revert
2024-03-18 15:55:11 +08:00
..
__init__.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
shard_config.py [shardformer] fix gathering output when using tensor parallelism (#5431) 2024-03-18 15:55:11 +08:00
sharder.py [nfc] fix typo colossalai/shardformer/ (#5133) 2024-01-04 16:21:55 +08:00
shardformer.py [example]add gpt2 benchmark example script. (#5295) 2024-03-04 16:18:13 +08:00
utils.py