ColossalAI/examples/tutorial/hybrid_parallel/config.py

from colossalai.amp import AMP_TYPE

# hyperparameters
# BATCH_SIZE is as per GPU
# global batch size = BATCH_SIZE x data parallel size
BATCH_SIZE = 4
LEARNING_RATE = 3e-3
WEIGHT_DECAY = 0.3
NUM_EPOCHS = 2
WARMUP_EPOCHS = 1

# model config
IMG_SIZE = 224
PATCH_SIZE = 16
HIDDEN_SIZE = 128
DEPTH = 4
NUM_HEADS = 4
MLP_RATIO = 2
NUM_CLASSES = 10
CHECKPOINT = False
SEQ_LENGTH = (IMG_SIZE // PATCH_SIZE)**2 + 1    # add 1 for cls token

# parallel setting
TENSOR_PARALLEL_SIZE = 2
TENSOR_PARALLEL_MODE = '1d'

parallel = dict(
    pipeline=2,
    tensor=dict(mode=TENSOR_PARALLEL_MODE, size=TENSOR_PARALLEL_SIZE),
)

fp16 = dict(mode=AMP_TYPE.NAIVE)
clip_grad_norm = 1.0

# pipeline config
NUM_MICRO_BATCHES = parallel['pipeline']
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00			`from colossalai.amp import AMP_TYPE`

			`# hyperparameters`
			`# BATCH_SIZE is as per GPU`
			`# global batch size = BATCH_SIZE x data parallel size`
[example] updated the hybrid parallel tutorial (#2444) * [example] updated the hybrid parallel tutorial * polish code 2023-01-11 07:17:17 +00:00			`BATCH_SIZE = 4`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00			`LEARNING_RATE = 3e-3`
			`WEIGHT_DECAY = 0.3`
[workflow] refactored the example check workflow (#2411) * [workflow] refactored the example check workflow * polish code * polish code * polish code * polish code * polish code * polish code * polish code * polish code * polish code * polish code * polish code 2023-01-10 03:26:19 +00:00			`NUM_EPOCHS = 2`
			`WARMUP_EPOCHS = 1`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00
			`# model config`
			`IMG_SIZE = 224`
			`PATCH_SIZE = 16`
[example] updated the hybrid parallel tutorial (#2444) * [example] updated the hybrid parallel tutorial * polish code 2023-01-11 07:17:17 +00:00			`HIDDEN_SIZE = 128`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00			`DEPTH = 4`
			`NUM_HEADS = 4`
			`MLP_RATIO = 2`
[example] updated the hybrid parallel tutorial (#2444) * [example] updated the hybrid parallel tutorial * polish code 2023-01-11 07:17:17 +00:00			`NUM_CLASSES = 10`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00			`CHECKPOINT = False`
			`SEQ_LENGTH = (IMG_SIZE // PATCH_SIZE)**2 + 1 # add 1 for cls token`

			`# parallel setting`
			`TENSOR_PARALLEL_SIZE = 2`
			`TENSOR_PARALLEL_MODE = '1d'`

			`parallel = dict(`
			`pipeline=2,`
			`tensor=dict(mode=TENSOR_PARALLEL_MODE, size=TENSOR_PARALLEL_SIZE),`
			`)`

			`fp16 = dict(mode=AMP_TYPE.NAIVE)`
			`clip_grad_norm = 1.0`

			`# pipeline config`
			`NUM_MICRO_BATCHES = parallel['pipeline']`