ColossalAI/examples/tutorial/large_batch_optimizer/config.py

from colossalai.amp import AMP_TYPE

# hyperparameters
# BATCH_SIZE is as per GPU
# global batch size = BATCH_SIZE x data parallel size
BATCH_SIZE = 512
LEARNING_RATE = 3e-3
WEIGHT_DECAY = 0.3
NUM_EPOCHS = 2
WARMUP_EPOCHS = 1

# model config
NUM_CLASSES = 10

fp16 = dict(mode=AMP_TYPE.NAIVE)
clip_grad_norm = 1.0
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00			`from colossalai.amp import AMP_TYPE`

			`# hyperparameters`
			`# BATCH_SIZE is as per GPU`
			`# global batch size = BATCH_SIZE x data parallel size`
			`BATCH_SIZE = 512`
			`LEARNING_RATE = 3e-3`
			`WEIGHT_DECAY = 0.3`
[example] updated large-batch optimizer tutorial (#2448) * [example] updated large-batch optimizer tutorial * polish code * polish code 2023-01-11 08:27:31 +00:00			`NUM_EPOCHS = 2`
			`WARMUP_EPOCHS = 1`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00
			`# model config`
[example] updated large-batch optimizer tutorial (#2448) * [example] updated large-batch optimizer tutorial * polish code * polish code 2023-01-11 08:27:31 +00:00			`NUM_CLASSES = 10`
[tutorial] edited hands-on practices (#1899) * Add handson to ColossalAI. * Change names of handsons and edit sequence parallel example. * Edit wrong folder name * resolve conflict * delete readme 2022-11-11 09:08:17 +00:00
			`fp16 = dict(mode=AMP_TYPE.NAIVE)`
			`clip_grad_norm = 1.0`