ColossalAI/tests/kit/model_zoo/transformers/vit.py

import torch
import transformers

from ..registry import ModelAttribute, model_zoo

# ===============================
# Register single-sentence VIT
# ===============================

config = transformers.ViTConfig(num_hidden_layers=4, hidden_size=128, intermediate_size=256, num_attention_heads=4)


# define data gen function
def data_gen():
    pixel_values = torch.randn(1, 3, 224, 224)
    return dict(pixel_values=pixel_values)


def data_gen_for_image_classification():
    data = data_gen()
    data["labels"] = torch.tensor([0])
    return data


def data_gen_for_masked_image_modeling():
    data = data_gen()
    num_patches = (config.image_size // config.patch_size) ** 2
    bool_masked_pos = torch.randint(low=0, high=2, size=(1, num_patches)).bool()
    data["bool_masked_pos"] = bool_masked_pos
    return data


# define output transform function
output_transform_fn = lambda x: x

# function to get the loss
loss_fn_for_vit_model = lambda x: x.pooler_output.mean()
loss_fn_for_image_classification = lambda x: x.logits.mean()
loss_fn_for_masked_image_modeling = lambda x: x.loss

# register the following models
# transformers.ViTModel,
# transformers.ViTForMaskedImageModeling,
# transformers.ViTForImageClassification,
model_zoo.register(
    name="transformers_vit",
    model_fn=lambda: transformers.ViTModel(config),
    data_gen_fn=data_gen,
    output_transform_fn=output_transform_fn,
    loss_fn=loss_fn_for_vit_model,
    model_attribute=ModelAttribute(has_control_flow=True),
)

model_zoo.register(
    name="transformers_vit_for_masked_image_modeling",
    model_fn=lambda: transformers.ViTForMaskedImageModeling(config),
    data_gen_fn=data_gen_for_masked_image_modeling,
    output_transform_fn=output_transform_fn,
    loss_fn=loss_fn_for_masked_image_modeling,
    model_attribute=ModelAttribute(has_control_flow=True),
)

model_zoo.register(
    name="transformers_vit_for_image_classification",
    model_fn=lambda: transformers.ViTForImageClassification(config),
    data_gen_fn=data_gen_for_image_classification,
    output_transform_fn=output_transform_fn,
    loss_fn=loss_fn_for_image_classification,
    model_attribute=ModelAttribute(has_control_flow=True),
)
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00			`import torch`
			`import transformers`

			`from ..registry import ModelAttribute, model_zoo`

			`# ===============================`
			`# Register single-sentence VIT`
			`# ===============================`

[shardformer] rewrite tests for opt/bloom/llama/vit/chatglm (#4395) * rewrite opt tests * rewrite llama tests * rewrite bloom & vit tests * rewrite chatglm tests * fix LinearCol for classfiers * add judge for other tp layers, fix lazy init in util 2023-08-11 07:43:23 +00:00			`config = transformers.ViTConfig(num_hidden_layers=4, hidden_size=128, intermediate_size=256, num_attention_heads=4)`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00

			`# define data gen function`
			`def data_gen():`
			`pixel_values = torch.randn(1, 3, 224, 224)`
			`return dict(pixel_values=pixel_values)`


			`def data_gen_for_image_classification():`
			`data = data_gen()`
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`data["labels"] = torch.tensor([0])`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00			`return data`


			`def data_gen_for_masked_image_modeling():`
			`data = data_gen()`
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`num_patches = (config.image_size // config.patch_size) ** 2`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00			`bool_masked_pos = torch.randint(low=0, high=2, size=(1, num_patches)).bool()`
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`data["bool_masked_pos"] = bool_masked_pos`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00			`return data`


			`# define output transform function`
			`output_transform_fn = lambda x: x`

			`# function to get the loss`
			`loss_fn_for_vit_model = lambda x: x.pooler_output.mean()`
			`loss_fn_for_image_classification = lambda x: x.logits.mean()`
			`loss_fn_for_masked_image_modeling = lambda x: x.loss`

			`# register the following models`
			`# transformers.ViTModel,`
			`# transformers.ViTForMaskedImageModeling,`
			`# transformers.ViTForImageClassification,`
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`model_zoo.register(`
			`name="transformers_vit",`
			`model_fn=lambda: transformers.ViTModel(config),`
			`data_gen_fn=data_gen,`
			`output_transform_fn=output_transform_fn,`
			`loss_fn=loss_fn_for_vit_model,`
			`model_attribute=ModelAttribute(has_control_flow=True),`
			`)`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`model_zoo.register(`
			`name="transformers_vit_for_masked_image_modeling",`
			`model_fn=lambda: transformers.ViTForMaskedImageModeling(config),`
			`data_gen_fn=data_gen_for_masked_image_modeling,`
			`output_transform_fn=output_transform_fn,`
			`loss_fn=loss_fn_for_masked_image_modeling,`
			`model_attribute=ModelAttribute(has_control_flow=True),`
			`)`
[shardformer] support pipeline base vit model (#4284) * Feature/vit support (#4182) * [shardformer] added tests * [shardformer] vit test finish and support * fix attention dropout * support base vit pipeline * support vit downstream model * fix vit shard test * modify hidden states return type --------- Co-authored-by: Kun Lin <81014421+klhhhhh@users.noreply.github.com> 2023-07-25 07:02:29 +00:00
[misc] update pre-commit and run all files (#4752) * [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format 2023-09-19 06:20:26 +00:00			`model_zoo.register(`
			`name="transformers_vit_for_image_classification",`
			`model_fn=lambda: transformers.ViTForImageClassification(config),`
			`data_gen_fn=data_gen_for_image_classification,`
			`output_transform_fn=output_transform_fn,`
			`loss_fn=loss_fn_for_image_classification,`
			`model_attribute=ModelAttribute(has_control_flow=True),`
			`)`