2022-11-12 10:24:52 +00:00
# Multi-dimensional Parallelism with Colossal-AI
2022-11-11 09:08:17 +00:00
2022-11-12 10:18:55 +00:00
## Install Titans Model Zoo
2022-11-11 09:08:17 +00:00
```bash
2022-11-12 10:18:55 +00:00
pip install titans
2022-11-11 09:08:17 +00:00
```
## Prepare Dataset
2022-11-12 10:18:55 +00:00
We use CIFAR10 dataset in this example. You should invoke the `donwload_cifar10.py` in the tutorial root directory or directly run the `auto_parallel_with_resnet.py` .
The dataset will be downloaded to `colossalai/examples/tutorials/data` by default.
2022-11-11 09:08:17 +00:00
If you wish to use customized directory for the dataset. You can set the environment variable `DATA` via the following command.
```bash
export DATA=/path/to/data
```
## Run on 2*2 device mesh
Current configuration setting on `config.py` is TP=2, PP=2.
```bash
2022-11-12 10:18:55 +00:00
# train with cifar10
2022-11-11 09:08:17 +00:00
colossalai run --nproc_per_node 4 train.py --config config.py
2022-11-12 10:18:55 +00:00
# train with synthetic data
colossalai run --nproc_per_node 4 train.py --config config.py
```