ColossalAI

History

digger-yu b9a8dff7e5 [doc] Fix typo under colossalai and doc(#3618 ) * Fixed several spelling errors under colossalai * Fix the spelling error in colossalai and docs directory * Cautious Changed the spelling error under the example folder * Update runtime_preparation_pass.py revert autograft to autograd * Update search_chunk.py utile to until * Update check_installation.py change misteach to mismatch in line 91 * Update 1D_tensor_parallel.md revert to perceptron * Update 2D_tensor_parallel.md revert to perceptron in line 73 * Update 2p5D_tensor_parallel.md revert to perceptron in line 71 * Update 3D_tensor_parallel.md revert to perceptron in line 80 * Update README.md revert to resnet in line 42 * Update reorder_graph.py revert to indice in line 7 * Update p2p.py revert to megatron in line 94 * Update initialize.py revert to torchrun in line 198 * Update routers.py change to detailed in line 63 * Update routers.py change to detailed in line 146 * Update README.md revert random number in line 402		2023-04-26 11:38:43 +08:00
..
README.md	[example] simplify opt example (#2344 )	2023-01-06 10:08:41 +08:00
benchmark.sh	[example] simplify opt example (#2344 )	2023-01-06 10:08:41 +08:00
requirements.txt	[example] fix requirements (#2488 )	2023-01-17 13:07:25 +08:00
run_gemini.sh	support shardinit option to avoid OPT OOM initializing problem (#3037 )	2023-03-08 13:45:15 +08:00
test_ci.sh	[CI] add test_ci.sh for palm, opt and gpt (#2475 )	2023-01-16 14:44:29 +08:00
train_gemini_opt.py	[doc] Fix typo under colossalai and doc(#3618 )	2023-04-26 11:38:43 +08:00

README.md

OPT

Meta recently released Open Pretrained Transformer (OPT), a 175-Billion parameter AI language model, which stimulates AI programmers to perform various downstream tasks and application deployments.

The following example of Colossal-AI demonstrates fine-tuning Casual Language Modelling at low cost.

We are using the pre-training weights of the OPT model provided by Hugging Face Hub on the raw WikiText-2 (no tokens were replaced before the tokenization). This training script is adapted from the HuggingFace Language Modelling examples.

Our Modifications

We adapt the OPT training code to ColossalAI by leveraging Gemini and ZeRO DDP.

Quick Start

You can launch training by using the following bash script

bash ./run_gemini.sh