|
# Image Synthesis |
|
|
|
Image synthesis is the base feature of DiffSynth Studio. We can generate images with very high resolution. |
|
|
|
### OmniGen |
|
|
|
OmniGen is a text-image-to-image model, you can synthesize an image according to several given reference images. |
|
|
|
|Reference image 1|Reference image 2|Synthesized image| |
|
|-|-|-| |
|
|||| |
|
|
|
### Example: FLUX |
|
|
|
Example script: [`flux_text_to_image.py`](./flux_text_to_image.py) and [`flux_text_to_image_low_vram.py`](./flux_text_to_image_low_vram.py)(low VRAM). |
|
|
|
The original version of FLUX doesn't support classifier-free guidance; however, we believe that this guidance mechanism is an important feature for synthesizing beautiful images. You can enable it using the parameter `cfg_scale`, and the extra guidance scale introduced by FLUX is `embedded_guidance`. |
|
|
|
|1024*1024 (original)|1024*1024 (classifier-free guidance)|2048*2048 (highres-fix)| |
|
|-|-|-| |
|
|||| |
|
|
|
### Example: Stable Diffusion |
|
|
|
Example script: [`sd_text_to_image.py`](./sd_text_to_image.py) |
|
|
|
LoRA Training: [`../train/stable_diffusion/`](../train/stable_diffusion/) |
|
|
|
|512*512|1024*1024|2048*2048|4096*4096| |
|
|-|-|-|-| |
|
||||| |
|
|
|
### Example: Stable Diffusion XL |
|
|
|
Example script: [`sdxl_text_to_image.py`](./sdxl_text_to_image.py) |
|
|
|
LoRA Training: [`../train/stable_diffusion_xl/`](../train/stable_diffusion_xl/) |
|
|
|
|1024*1024|2048*2048| |
|
|-|-| |
|
||| |
|
|
|
### Example: Stable Diffusion 3 |
|
|
|
Example script: [`sd3_text_to_image.py`](./sd3_text_to_image.py) |
|
|
|
LoRA Training: [`../train/stable_diffusion_3/`](../train/stable_diffusion_3/) |
|
|
|
|1024*1024|2048*2048| |
|
|-|-| |
|
||| |
|
|
|
### Example: Kolors |
|
|
|
Example script: [`kolors_text_to_image.py`](./kolors_text_to_image.py) |
|
|
|
LoRA Training: [`../train/kolors/`](../train/kolors/) |
|
|
|
|1024*1024|2048*2048| |
|
|-|-| |
|
||| |
|
|
|
Kolors also support the models trained for SD-XL. For example, ControlNets and LoRAs. See [`kolors_with_sdxl_models.py`](./kolors_with_sdxl_models.py) |
|
|
|
LoRA: https://civitai.com/models/73305/zyd232s-ink-style |
|
|
|
|Base model|with LoRA (alpha=0.5)|with LoRA (alpha=1.0)|with LoRA (alpha=1.5)| |
|
|-|-|-|-| |
|
||||| |
|
|
|
ControlNet: https://huggingface.co/xinsir/controlnet-union-sdxl-1.0 |
|
|
|
|Reference image|Depth image|with ControlNet|with ControlNet| |
|
|-|-|-|-| |
|
||||| |
|
|
|
### Example: Hunyuan-DiT |
|
|
|
Example script: [`hunyuan_dit_text_to_image.py`](./hunyuan_dit_text_to_image.py) |
|
|
|
LoRA Training: [`../train/hunyuan_dit/`](../train/hunyuan_dit/) |
|
|
|
|1024*1024|2048*2048| |
|
|-|-| |
|
||| |
|
|
|
### Example: Stable Diffusion XL Turbo |
|
|
|
Example script: [`sdxl_turbo.py`](./sdxl_turbo.py) |
|
|
|
We highly recommend you to use this model in the WebUI. |
|
|
|
|"black car"|"red car"| |
|
|-|-| |
|
||| |
|
|