ColossalAI/docs/source/en
Wenhao Chen 7172459e74
[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088)
* [shardformer] implement policy for all GPT-J models and test

* [shardformer] support interleaved pipeline parallel for bert finetune

* [shardformer] shardformer support falcon (#4883)

* [shardformer]: fix interleaved pipeline for bert model (#5048)

* [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093)

* Add Mistral support for Shardformer (#5103)

* [shardformer] add tests to mistral (#5105)

---------

Co-authored-by: Pengtai Xu <henryxu880@gmail.com>
Co-authored-by: ppt0011 <143150326+ppt0011@users.noreply.github.com>
Co-authored-by: flybird11111 <1829166702@qq.com>
Co-authored-by: eric8607242 <e0928021388@gmail.com>
2023-11-28 16:54:42 +08:00
..
Colossal-Auto [doc] Fix typo under colossalai and doc(#3618) 2023-04-26 11:38:43 +08:00
advanced_tutorials [doc] update advanced tutorials, training gpt with hybrid parallelism (#4866) 2023-10-10 08:18:55 +00:00
basics [nfc] fix typo in docs/ (#4972) 2023-11-21 22:06:20 +08:00
concepts [doc]update moe chinese document. (#3890) 2023-06-05 15:57:54 +08:00
features [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
get_started [doc] update and revise some typos and errs in docs (#4107) 2023-06-28 19:30:37 +08:00
sidebar_category_translation.json [dooc] fixed the sidebar itemm key (#2672) 2023-02-13 10:45:16 +08:00