ColossalAI/examples/language
botbw 2fc85abf43
[gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713)
* [gemini] async grad chunk reduce (all-reduce&reduce-scatter)

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [gemini] add test

* [gemini] rename func

* [gemini] update llama benchmark

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [gemini] use tensor counter

* [gemini] change default config in GeminiPlugin and GeminiDDP

* [chore] typo

* [gemini] fix sync issue & add test cases

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-24 10:31:16 +08:00
..
bert [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
commons [example] make gpt example directory more clear (#2353) 2023-01-06 11:11:26 +08:00
gpt fix missing pad token (#5690) 2024-05-06 18:17:26 +08:00
grok-1 [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
llama [gemini] async grad chunk reduce (all-reduce&reduce-scatter) (#5713) 2024-05-24 10:31:16 +08:00
openmoe [hotfix] fix OpenMOE example import path (#5697) 2024-05-08 15:48:47 +08:00
opt [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
palm [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
__init__.py [example]add gpt2 benchmark example script. (#5295) 2024-03-04 16:18:13 +08:00
data_utils.py [devops] remove post commit ci (#5566) 2024-04-08 15:09:40 +08:00
model_utils.py [example]add gpt2 benchmark example script. (#5295) 2024-03-04 16:18:13 +08:00
performance_evaluator.py [example]add gpt2 benchmark example script. (#5295) 2024-03-04 16:18:13 +08:00