mirror of https://github.com/hpcaitech/ColossalAI
f7774ec0f3
* add dist dropout in model * update docstring and bert policy with dropout * refactor basepolicy and sharded, update bert * update format * update gpt2 policy * update bert policy * remove unused code * update readme for new policy usage * add downstream model of bert * remove unused code |
||
---|---|---|
.. | ||
test_shard_bert.py | ||
test_shard_llama.py | ||
test_shard_t5.py |