ColossalAI/examples/images/diffusion/README.md

# ColoDiffusion: Stable Diffusion with Colossal-AI


Acceleration of AIGC (AI-Generated Content) models such as [Stable Diffusion v1](https://github.com/CompVis/stable-diffusion) and [Stable Diffusion v2](https://github.com/Stability-AI/stablediffusion).

<p id="diffusion_train" align="center">
<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/Stable%20Diffusion%20v2.png" width=800/>
</p>

- [Training](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion): Reduce Stable Diffusion memory consumption by up to 5.6x and hardware cost by up to 46x (from A100 to RTX3060).

<p id="diffusion_demo" align="center">
<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/DreamBooth.png" width=800/>
</p>


- [DreamBooth Fine-tuning](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/dreambooth): Personalize your model using just 3-5 images of the desired subject.

<p id="inference" align="center">
<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/Stable%20Diffusion%20Inference.jpg" width=800/>
</p>


- [Inference](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion): Reduce inference GPU memory consumption by 2.5x.


More details can be found in our [blog of Stable Diffusion v1](https://www.hpc-ai.tech/blog/diffusion-pretraining-and-hardware-fine-tuning-can-be-almost-7x-cheaper) and [blog of Stable Diffusion v2](https://www.hpc-ai.tech/blog/colossal-ai-0-2-0).


## Roadmap
This project is in rapid development.

- [X] Train a stable diffusion model v1/v2 from scatch
- [X] Finetune a pretrained Stable diffusion v1 model
- [X] Inference a pretrained model using PyTorch
- [ ] Finetune a pretrained Stable diffusion v2 model
- [ ] Inference a pretrained model using TensoRT

## Installation

### Option #1: install from source
#### Step 1: Requirements

A suitable [conda](https://conda.io/) environment named `ldm` can be created
and activated with:

```
conda env create -f environment.yaml
conda activate ldm
```

You can also update an existing [latent diffusion](https://github.com/CompVis/latent-diffusion) environment by running

```
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
pip install transformers==4.19.2 diffusers invisible-watermark
pip install -e .
```

##### Step 2: install lightning

Install Lightning version later than 2022.01.04. We suggest you install lightning from source.

https://github.com/Lightning-AI/lightning.git


##### Step 3:Install [Colossal-AI](https://colossalai.org/download/) From Our Official Website

For example, you can install  v0.1.12 from our official website.

```
pip install colossalai==0.1.12+torch1.12cu11.3 -f https://release.colossalai.org
```

### Option #2: Use Docker

To use the stable diffusion Docker image, you can either build using the provided the [Dockerfile](./docker/Dockerfile) or pull a Docker image from our Docker hub.

```
# 1. build from dockerfile
cd docker
docker build -t hpcaitech/diffusion:0.2.0  .

# 2. pull from our docker hub
docker pull hpcaitech/diffusion:0.2.0
```

Once you have the image ready, you can launch the image with the following command:

```bash
########################
# On Your Host Machine #
########################
# make sure you start your image in the repository root directory
cd Colossal-AI

# run the docker container
docker run --rm \
  -it --gpus all \
  -v $PWD:/workspace \
  -v <your-data-dir>:/data/scratch \
  -v <hf-cache-dir>:/root/.cache/huggingface \
  hpcaitech/diffusion:0.2.0 \
  /bin/bash

########################
#  Insider Container   #
########################
# Once you have entered the docker container, go to the stable diffusion directory for training
cd examples/images/diffusion/

# start training with colossalai
bash train_colossalai.sh
```

It is important for you to configure your volume mapping in order to get the best training experience.
1. **Mandatory**, mount your prepared data to `/data/scratch` via `-v <your-data-dir>:/data/scratch`, where you need to replace `<your-data-dir>` with the actual data path on your machine.
2. **Recommended**, store the downloaded model weights to your host machine instead of the container directory via `-v <hf-cache-dir>:/root/.cache/huggingface`, where you need to repliace the `<hf-cache-dir>` with the actual path. In this way, you don't have to repeatedly download the pretrained weights for every `docker run`.
3. **Optional**, if you encounter any problem stating that shared memory is insufficient inside container, please add `-v /dev/shm:/dev/shm` to your `docker run` command.


## Download the model checkpoint from pretrained

### stable-diffusion-v1-4

Our default model config use the weight from [CompVis/stable-diffusion-v1-4](https://huggingface.co/CompVis/stable-diffusion-v1-4?text=A+mecha+robot+in+a+favela+in+expressionist+style)

```
git lfs install
git clone https://huggingface.co/CompVis/stable-diffusion-v1-4
```

### stable-diffusion-v1-5 from runway

If you want to useed the Last [stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5) weight from runwayml

```
git lfs install
git clone https://huggingface.co/runwayml/stable-diffusion-v1-5
```

## Dataset

The dataSet is from [LAION-5B](https://laion.ai/blog/laion-5b/), the subset of [LAION](https://laion.ai/),
you should the change the `data.file_path` in the `config/train_colossalai.yaml`

## Training

We provide the script `train_colossalai.sh` to run the training task with colossalai,
and can also use `train_ddp.sh` to run the training task with ddp to compare.

In `train_colossalai.sh` the main command is:
```
python main.py --logdir /tmp/ -t -b configs/train_colossalai.yaml
```

- you can change the `--logdir` to decide where to save the log information and the last checkpoint.

### Training config

You can change the trainging config in the yaml file

- devices: device number used for training, default 8
- max_epochs: max training epochs, default 2
- precision: the precision type used in training, default 16 (fp16), you must use fp16 if you want to apply colossalai
- more information about the configuration of ColossalAIStrategy can be found [here](https://pytorch-lightning.readthedocs.io/en/latest/advanced/model_parallel.html#colossal-ai)

## Finetune Example (Work In Progress)
### Training on Teyvat Datasets

We provide the finetuning example on [Teyvat](https://huggingface.co/datasets/Fazzie/Teyvat) dataset, which is create by BLIP generated captions.

You can run by config `configs/Teyvat/train_colossalai_teyvat.yaml`
```
python main.py --logdir /tmp/ -t -b configs/Teyvat/train_colossalai_teyvat.yaml
```

## Inference
you can get yout training last.ckpt and train config.yaml in your `--logdir`, and run by
```
python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms
    --outdir ./output \
    --config path/to/logdir/checkpoints/last.ckpt \
    --ckpt /path/to/logdir/configs/project.yaml  \
```

```commandline
usage: txt2img.py [-h] [--prompt [PROMPT]] [--outdir [OUTDIR]] [--skip_grid] [--skip_save] [--ddim_steps DDIM_STEPS] [--plms] [--laion400m] [--fixed_code] [--ddim_eta DDIM_ETA]
                  [--n_iter N_ITER] [--H H] [--W W] [--C C] [--f F] [--n_samples N_SAMPLES] [--n_rows N_ROWS] [--scale SCALE] [--from-file FROM_FILE] [--config CONFIG] [--ckpt CKPT]
                  [--seed SEED] [--precision {full,autocast}]

optional arguments:
  -h, --help            show this help message and exit
  --prompt [PROMPT]     the prompt to render
  --outdir [OUTDIR]     dir to write results to
  --skip_grid           do not save a grid, only individual samples. Helpful when evaluating lots of samples
  --skip_save           do not save individual samples. For speed measurements.
  --ddim_steps DDIM_STEPS
                        number of ddim sampling steps
  --plms                use plms sampling
  --laion400m           uses the LAION400M model
  --fixed_code          if enabled, uses the same starting code across samples
  --ddim_eta DDIM_ETA   ddim eta (eta=0.0 corresponds to deterministic sampling
  --n_iter N_ITER       sample this often
  --H H                 image height, in pixel space
  --W W                 image width, in pixel space
  --C C                 latent channels
  --f F                 downsampling factor
  --n_samples N_SAMPLES
                        how many samples to produce for each given prompt. A.k.a. batch size
  --n_rows N_ROWS       rows in the grid (default: n_samples)
  --scale SCALE         unconditional guidance scale: eps = eps(x, empty) + scale * (eps(x, cond) - eps(x, empty))
  --from-file FROM_FILE
                        if specified, load prompts from this file
  --config CONFIG       path to config which constructs model
  --ckpt CKPT           path to checkpoint of model
  --seed SEED           the seed (for reproducible sampling)
  --use_int8            whether to use quantization method
  --precision {full,autocast}
                        evaluate at this precision
```

## Comments

- Our codebase for the diffusion models builds heavily on [OpenAI's ADM codebase](https://github.com/openai/guided-diffusion)
, [lucidrains](https://github.com/lucidrains/denoising-diffusion-pytorch),
[Stable Diffusion](https://github.com/CompVis/stable-diffusion), [Lightning](https://github.com/Lightning-AI/lightning) and [Hugging Face](https://huggingface.co/CompVis/stable-diffusion).
Thanks for open-sourcing!

- The implementation of the transformer encoder is from [x-transformers](https://github.com/lucidrains/x-transformers) by [lucidrains](https://github.com/lucidrains?tab=repositories).

- The implementation of [flash attention](https://github.com/HazyResearch/flash-attention) is from [HazyResearch](https://github.com/HazyResearch).

## BibTeX

```
@article{bian2021colossal,
  title={Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training},
  author={Bian, Zhengda and Liu, Hongxin and Wang, Boxiang and Huang, Haichen and Li, Yongbin and Wang, Chuanrui and Cui, Fan and You, Yang},
  journal={arXiv preprint arXiv:2110.14883},
  year={2021}
}
@misc{rombach2021highresolution,
  title={High-Resolution Image Synthesis with Latent Diffusion Models},
  author={Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer},
  year={2021},
  eprint={2112.10752},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}
@article{dao2022flashattention,
  title={FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness},
  author={Dao, Tri and Fu, Daniel Y. and Ermon, Stefano and Rudra, Atri and R{\'e}, Christopher},
  journal={arXiv preprint arXiv:2205.14135},
  year={2022}
}
```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`# ColoDiffusion: Stable Diffusion with Colossal-AI`

[workflow]New version: Create workflow files for examples' auto check (#2298) * [workflows]bug_repair * [workflow]new_pr_fixing_bugs Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2023-01-06 01:26:49 +00:00
[example] update diffusion readme with official lightning (#2304) 2023-01-04 05:13:38 +00:00			`Acceleration of AIGC (AI-Generated Content) models such as [Stable Diffusion v1](https://github.com/CompVis/stable-diffusion) and [Stable Diffusion v2](https://github.com/Stability-AI/stablediffusion).`
[workflow]New version: Create workflow files for examples' auto check (#2298) * [workflows]bug_repair * [workflow]new_pr_fixing_bugs Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2023-01-06 01:26:49 +00:00
[doc] update diffusion doc (#2296) 2023-01-03 13:27:44 +00:00			`<p id="diffusion_train" align="center">`
			`<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/Stable%20Diffusion%20v2.png" width=800/>`
			`</p>`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[doc] update diffusion doc (#2296) 2023-01-03 13:27:44 +00:00			`- [Training](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion): Reduce Stable Diffusion memory consumption by up to 5.6x and hardware cost by up to 46x (from A100 to RTX3060).`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[doc] update diffusion doc (#2296) 2023-01-03 13:27:44 +00:00			`<p id="diffusion_demo" align="center">`
			`<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/DreamBooth.png" width=800/>`
			`</p>`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[workflow]New version: Create workflow files for examples' auto check (#2298) * [workflows]bug_repair * [workflow]new_pr_fixing_bugs Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2023-01-06 01:26:49 +00:00
[doc] update stable diffusion link (#2322) * [doc] update link 2023-01-04 11:38:06 +00:00			`- [DreamBooth Fine-tuning](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/dreambooth): Personalize your model using just 3-5 images of the desired subject.`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[doc] update diffusion doc (#2296) 2023-01-03 13:27:44 +00:00			`<p id="inference" align="center">`
			`<img src="https://raw.githubusercontent.com/hpcaitech/public_assets/main/colossalai/img/Stable%20Diffusion%20Inference.jpg" width=800/>`
[doc] polish diffusion README (#1840) 2022-11-08 14:36:55 +00:00			`</p>`

[workflow]New version: Create workflow files for examples' auto check (#2298) * [workflows]bug_repair * [workflow]new_pr_fixing_bugs Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2023-01-06 01:26:49 +00:00
[doc] update stable diffusion link (#2322) * [doc] update link 2023-01-04 11:38:06 +00:00			`- [Inference](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion): Reduce inference GPU memory consumption by 2.5x.`
[doc] polish diffusion README (#1840) 2022-11-08 14:36:55 +00:00
[workflow]New version: Create workflow files for examples' auto check (#2298) * [workflows]bug_repair * [workflow]new_pr_fixing_bugs Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2023-01-06 01:26:49 +00:00
[doc] update diffusion doc (#2296) 2023-01-03 13:27:44 +00:00			`More details can be found in our [blog of Stable Diffusion v1](https://www.hpc-ai.tech/blog/diffusion-pretraining-and-hardware-fine-tuning-can-be-almost-7x-cheaper) and [blog of Stable Diffusion v2](https://www.hpc-ai.tech/blog/colossal-ai-0-2-0).`
[doc] polish diffusion README (#1840) 2022-11-08 14:36:55 +00:00
[example] stable diffusion add roadmap (#2482) 2023-01-16 04:14:49 +00:00
			`## Roadmap`
			`This project is in rapid development.`

			`- [X] Train a stable diffusion model v1/v2 from scatch`
			`- [X] Finetune a pretrained Stable diffusion v1 model`
			`- [X] Inference a pretrained model using PyTorch`
			`- [ ] Finetune a pretrained Stable diffusion v2 model`
			`- [ ] Inference a pretrained model using TensoRT`

[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00			`## Installation`

[doc] updated the stable diffussion on docker usage (#2244) * [doc] updated the stable diffussion on docker usage * polish doc 2022-12-30 10:00:20 +00:00			`### Option #1: install from source`
[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00			`#### Step 1: Requirements`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			A suitable [conda](https://conda.io/) environment named `ldm` can be created
			`and activated with:`

			```
			`conda env create -f environment.yaml`
			`conda activate ldm`
			```

			`You can also update an existing [latent diffusion](https://github.com/CompVis/latent-diffusion) environment by running`

			```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			`pip install transformers==4.19.2 diffusers invisible-watermark`
			`pip install -e .`
[example] polish diffusion readme 2022-11-09 01:38:05 +00:00			```
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00			`##### Step 2: install lightning`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] update diffusion readme with official lightning (#2304) 2023-01-04 05:13:38 +00:00			`Install Lightning version later than 2022.01.04. We suggest you install lightning from source.`

			`https://github.com/Lightning-AI/lightning.git`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] update diffusion readme with official lightning (#2304) 2023-01-04 05:13:38 +00:00
			`##### Step 3:Install [Colossal-AI](https://colossalai.org/download/) From Our Official Website`

			`For example, you can install v0.1.12 from our official website.`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`pip install colossalai==0.1.12+torch1.12cu11.3 -f https://release.colossalai.org`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			```

[doc] updated the stable diffussion on docker usage (#2244) * [doc] updated the stable diffussion on docker usage * polish doc 2022-12-30 10:00:20 +00:00			`### Option #2: Use Docker`

			`To use the stable diffusion Docker image, you can either build using the provided the [Dockerfile](./docker/Dockerfile) or pull a Docker image from our Docker hub.`
[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00
			```
[doc] updated the stable diffussion on docker usage (#2244) * [doc] updated the stable diffussion on docker usage * polish doc 2022-12-30 10:00:20 +00:00			`# 1. build from dockerfile`
[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00			`cd docker`
			`docker build -t hpcaitech/diffusion:0.2.0 .`
[doc] updated the stable diffussion on docker usage (#2244) * [doc] updated the stable diffussion on docker usage * polish doc 2022-12-30 10:00:20 +00:00
			`# 2. pull from our docker hub`
			`docker pull hpcaitech/diffusion:0.2.0`
			```

			`Once you have the image ready, you can launch the image with the following command:`

			```bash
			`########################`
			`# On Your Host Machine #`
			`########################`
			`# make sure you start your image in the repository root directory`
			`cd Colossal-AI`

			`# run the docker container`
			`docker run --rm \`
			`-it --gpus all \`
			`-v $PWD:/workspace \`
			`-v <your-data-dir>:/data/scratch \`
			`-v <hf-cache-dir>:/root/.cache/huggingface \`
			`hpcaitech/diffusion:0.2.0 \`
			`/bin/bash`

			`########################`
			`# Insider Container #`
			`########################`
			`# Once you have entered the docker container, go to the stable diffusion directory for training`
			`cd examples/images/diffusion/`

			`# start training with colossalai`
			`bash train_colossalai.sh`
[example] diffusion install from docker (#2239) * [builder] builder for scaled_upper_triang_masked_softmax * add missing files * fix a bug * polish code * [example] diffusion install from docker 2022-12-30 08:25:24 +00:00			```

[doc] updated the stable diffussion on docker usage (#2244) * [doc] updated the stable diffussion on docker usage * polish doc 2022-12-30 10:00:20 +00:00			`It is important for you to configure your volume mapping in order to get the best training experience.`
			1. Mandatory, mount your prepared data to `/data/scratch` via `-v <your-data-dir>:/data/scratch`, where you need to replace `<your-data-dir>` with the actual data path on your machine.
			2. Recommended, store the downloaded model weights to your host machine instead of the container directory via `-v <hf-cache-dir>:/root/.cache/huggingface`, where you need to repliace the `<hf-cache-dir>` with the actual path. In this way, you don't have to repeatedly download the pretrained weights for every `docker run`.
			3. Optional, if you encounter any problem stating that shared memory is insufficient inside container, please add `-v /dev/shm:/dev/shm` to your `docker run` command.



update model download in README 2022-11-16 03:15:55 +00:00			`## Download the model checkpoint from pretrained`

			`### stable-diffusion-v1-4`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
update model download in README 2022-11-16 03:15:55 +00:00			`Our default model config use the weight from [CompVis/stable-diffusion-v1-4](https://huggingface.co/CompVis/stable-diffusion-v1-4?text=A+mecha+robot+in+a+favela+in+expressionist+style)`

			```
			`git lfs install`
			`git clone https://huggingface.co/CompVis/stable-diffusion-v1-4`
			```

			`### stable-diffusion-v1-5 from runway`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] stable diffusion add roadmap (#2482) 2023-01-16 04:14:49 +00:00			`If you want to useed the Last [stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5) weight from runwayml`
update model download in README 2022-11-16 03:15:55 +00:00
			```
			`git lfs install`
			`git clone https://huggingface.co/runwayml/stable-diffusion-v1-5`
			```

[example] add stable diffuser (#1825) 2022-11-08 08:14:45 +00:00			`## Dataset`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00			`The dataSet is from [LAION-5B](https://laion.ai/blog/laion-5b/), the subset of [LAION](https://laion.ai/),`
[example] add stable diffuser (#1825) 2022-11-08 08:14:45 +00:00			you should the change the `data.file_path` in the `config/train_colossalai.yaml`

[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			`## Training`

[example] Change some training settings for diffusion (#2195) 2022-12-26 07:22:20 +00:00			We provide the script `train_colossalai.sh` to run the training task with colossalai,
			and can also use `train_ddp.sh` to run the training task with ddp to compare.
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[example] Change some training settings for diffusion (#2195) 2022-12-26 07:22:20 +00:00			In `train_colossalai.sh` the main command is:
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`python main.py --logdir /tmp/ -t -b configs/train_colossalai.yaml`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			```

[example] Change some training settings for diffusion (#2195) 2022-12-26 07:22:20 +00:00			- you can change the `--logdir` to decide where to save the log information and the last checkpoint.
[example] add stable diffuser (#1825) 2022-11-08 08:14:45 +00:00
			`### Training config`
support stable diffusion v2 2022-12-12 09:35:23 +00:00
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00			`You can change the trainging config in the yaml file`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[diffusion] update readme (#2214) 2022-12-28 08:06:48 +00:00			`- devices: device number used for training, default 8`
			`- max_epochs: max training epochs, default 2`
			`- precision: the precision type used in training, default 16 (fp16), you must use fp16 if you want to apply colossalai`
			`- more information about the configuration of ColossalAIStrategy can be found [here](https://pytorch-lightning.readthedocs.io/en/latest/advanced/model_parallel.html#colossal-ai)`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[example] stable diffusion add roadmap (#2482) 2023-01-16 04:14:49 +00:00			`## Finetune Example (Work In Progress)`
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`### Training on Teyvat Datasets`
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`We provide the finetuning example on [Teyvat](https://huggingface.co/datasets/Fazzie/Teyvat) dataset, which is create by BLIP generated captions.`
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00
support stable diffusion v2 2022-12-12 09:35:23 +00:00			You can run by config `configs/Teyvat/train_colossalai_teyvat.yaml`
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00			```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`python main.py --logdir /tmp/ -t -b configs/Teyvat/train_colossalai_teyvat.yaml`
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00			```

[example] add diffusion inference (#1986) 2022-11-20 10:35:29 +00:00			`## Inference`
			you can get yout training last.ckpt and train config.yaml in your `--logdir`, and run by
			```
support stable diffusion v2 2022-12-12 09:35:23 +00:00			`python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms`
[example] add diffusion inference (#1986) 2022-11-20 10:35:29 +00:00			`--outdir ./output \`
			`--config path/to/logdir/checkpoints/last.ckpt \`
			`--ckpt /path/to/logdir/configs/project.yaml \`
			```

			```commandline
			`usage: txt2img.py [-h] [--prompt [PROMPT]] [--outdir [OUTDIR]] [--skip_grid] [--skip_save] [--ddim_steps DDIM_STEPS] [--plms] [--laion400m] [--fixed_code] [--ddim_eta DDIM_ETA]`
			`[--n_iter N_ITER] [--H H] [--W W] [--C C] [--f F] [--n_samples N_SAMPLES] [--n_rows N_ROWS] [--scale SCALE] [--from-file FROM_FILE] [--config CONFIG] [--ckpt CKPT]`
			`[--seed SEED] [--precision {full,autocast}]`

			`optional arguments:`
			`-h, --help show this help message and exit`
			`--prompt [PROMPT] the prompt to render`
			`--outdir [OUTDIR] dir to write results to`
			`--skip_grid do not save a grid, only individual samples. Helpful when evaluating lots of samples`
			`--skip_save do not save individual samples. For speed measurements.`
			`--ddim_steps DDIM_STEPS`
			`number of ddim sampling steps`
			`--plms use plms sampling`
			`--laion400m uses the LAION400M model`
			`--fixed_code if enabled, uses the same starting code across samples`
			`--ddim_eta DDIM_ETA ddim eta (eta=0.0 corresponds to deterministic sampling`
			`--n_iter N_ITER sample this often`
			`--H H image height, in pixel space`
			`--W W image width, in pixel space`
			`--C C latent channels`
			`--f F downsampling factor`
			`--n_samples N_SAMPLES`
			`how many samples to produce for each given prompt. A.k.a. batch size`
			`--n_rows N_ROWS rows in the grid (default: n_samples)`
			`--scale SCALE unconditional guidance scale: eps = eps(x, empty) + scale * (eps(x, cond) - eps(x, empty))`
			`--from-file FROM_FILE`
			`if specified, load prompts from this file`
			`--config CONFIG path to config which constructs model`
			`--ckpt CKPT path to checkpoint of model`
			`--seed SEED the seed (for reproducible sampling)`
[example] Change some training settings for diffusion (#2195) 2022-12-26 07:22:20 +00:00			`--use_int8 whether to use quantization method`
[example] add diffusion inference (#1986) 2022-11-20 10:35:29 +00:00			`--precision {full,autocast}`
			`evaluate at this precision`
			```
[example] add cifar10 dadaset for diffusion (#1902) * add cifar10 dadasets * Update README.md Co-authored-by: binmakeswell <binmakeswell@gmail.com> 2022-11-11 09:22:54 +00:00
[example] polish diffusion readme 2022-11-09 01:38:05 +00:00			`## Comments`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
			`- Our codebase for the diffusion models builds heavily on [OpenAI's ADM codebase](https://github.com/openai/guided-diffusion)`
add explanation specified version 2022-11-09 04:04:49 +00:00			`, [lucidrains](https://github.com/lucidrains/denoising-diffusion-pytorch),`
			`[Stable Diffusion](https://github.com/CompVis/stable-diffusion), [Lightning](https://github.com/Lightning-AI/lightning) and [Hugging Face](https://huggingface.co/CompVis/stable-diffusion).`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			`Thanks for open-sourcing!`

[example] polish diffusion readme 2022-11-09 01:38:05 +00:00			`- The implementation of the transformer encoder is from [x-transformers](https://github.com/lucidrains/x-transformers) by [lucidrains](https://github.com/lucidrains?tab=repositories).`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
[example] polish diffusion readme 2022-11-09 01:38:05 +00:00			`- The implementation of [flash attention](https://github.com/HazyResearch/flash-attention) is from [HazyResearch](https://github.com/HazyResearch).`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00
			`## BibTeX`

			```
[doc] polish diffusion README (#1840) 2022-11-08 14:36:55 +00:00			`@article{bian2021colossal,`
			`title={Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training},`
			`author={Bian, Zhengda and Liu, Hongxin and Wang, Boxiang and Huang, Haichen and Li, Yongbin and Wang, Chuanrui and Cui, Fan and You, Yang},`
			`journal={arXiv preprint arXiv:2110.14883},`
			`year={2021}`
			`}`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			`@misc{rombach2021highresolution,`
add explanation specified version 2022-11-09 04:04:49 +00:00			`title={High-Resolution Image Synthesis with Latent Diffusion Models},`
			`author={Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer},`
			`year={2021},`
			`eprint={2112.10752},`
			`archivePrefix={arXiv},`
			`primaryClass={cs.CV}`
[example] add diffusion to example (#1805) 2022-11-07 09:43:36 +00:00			`}`
			`@article{dao2022flashattention,`
			`title={FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness},`
			`author={Dao, Tri and Fu, Daniel Y. and Ermon, Stefano and Rudra, Atri and R{\'e}, Christopher},`
			`journal={arXiv preprint arXiv:2205.14135},`
			`year={2022}`
			`}`
			```