ColossalAI/docs/model.md

# Define your own parallel model

Let's say that you have a huge MLP model with billions of parameters and its extremely large hidden layer size makes it
impossible to fit into a single GPU directly. Don't worry, ColossalAI is here to help you sort things out. With the help of ColossalAI, 
you can write your model in the familiar way in which you used to write models for a single GPU, while ColossalAI automatically 
splits your model weights and fit them perfectly into a set of GPUs. We give a simple example showing how to write a simple 
2D parallel model in the Colossal-AI context.

## Write a simple 2D parallel model

```python
from colossalai.nn import Linear2D
import torch.nn as nn

class MLP_2D(nn.Module):

    def __init__(self):
        super().__init__()
        self.linear_1 = Linear2D(in_features=1024, out_features=16384)
        self.linear_2 = Linear2D(in_features=16384, out_features=1024)

    def forward(self, x):
        x = self.linear_1(x)
        x = self.linear_2(x)
        return x
```

## Use pre-defined model

For the sake of your convenience, we kindly provide you in our Model Zoo with some prevalent models such as *BERT*, *VIT*, 
and *MLP-Mixer*. Feel free to customize them into different sizes to fit into your special needs.
Migrated project 2021-10-28 16:21:23 +00:00			`# Define your own parallel model`

added Chinese documents and fixed some typos in English documents 2021-11-02 15:01:13 +00:00			`Let's say that you have a huge MLP model with billions of parameters and its extremely large hidden layer size makes it`
fixed some typos in the documents, added blog link and paper author information in README 2021-11-03 08:07:28 +00:00			`impossible to fit into a single GPU directly. Don't worry, ColossalAI is here to help you sort things out. With the help of ColossalAI,`
added Chinese documents and fixed some typos in English documents 2021-11-02 15:01:13 +00:00			`you can write your model in the familiar way in which you used to write models for a single GPU, while ColossalAI automatically`
			`splits your model weights and fit them perfectly into a set of GPUs. We give a simple example showing how to write a simple`
fixed some typos in the documents, added blog link and paper author information in README 2021-11-03 08:07:28 +00:00			`2D parallel model in the Colossal-AI context.`
Migrated project 2021-10-28 16:21:23 +00:00
added Chinese documents and fixed some typos in English documents 2021-11-02 15:01:13 +00:00			`## Write a simple 2D parallel model`
Migrated project 2021-10-28 16:21:23 +00:00
			```python
			`from colossalai.nn import Linear2D`
			`import torch.nn as nn`

			`class MLP_2D(nn.Module):`

			`def __init__(self):`
			`super().__init__()`
			`self.linear_1 = Linear2D(in_features=1024, out_features=16384)`
			`self.linear_2 = Linear2D(in_features=16384, out_features=1024)`

			`def forward(self, x):`
			`x = self.linear_1(x)`
			`x = self.linear_2(x)`
			`return x`
			```

			`## Use pre-defined model`
added Chinese documents and fixed some typos in English documents 2021-11-02 15:01:13 +00:00
			`For the sake of your convenience, we kindly provide you in our Model Zoo with some prevalent models such as BERT, VIT,`
			`and MLP-Mixer. Feel free to customize them into different sizes to fit into your special needs.`