mirror of https://github.com/hpcaitech/ColossalAI
aibig-modeldata-parallelismdeep-learningdistributed-computingfoundation-modelsheterogeneous-traininghpcinferencelarge-scalemodel-parallelismpipeline-parallelism
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
19 lines
419 B
19 lines
419 B
NUM_GPU=8 |
|
MODEL="mistralai/Mixtral-8x7B-v0.1" |
|
SEQ_LENGTH=2048 |
|
BATCH_SIZE=1 |
|
LR=0.00001 |
|
|
|
# hybrid |
|
# torchrun --standalone --nproc_per_node $NUM_GPU \ |
|
colossalai run --nproc_per_node $NUM_GPU --hostfile "hostfile" \ |
|
train.py \ |
|
--num_epoch 1 \ |
|
--model_name $MODEL \ |
|
--plugin "hybrid" \ |
|
--batch_size $BATCH_SIZE \ |
|
--lr $LR \ |
|
--zero_stage 1 \ |
|
--pp_size 2 \ |
|
--dp_size 1 \ |
|
--ep_size 8 \
|
|
|