xzuyn's picture
Update README.md
82544ba verified
metadata
license: apache-2.0
base_model: Qwen/Qwen-Image
language:
  - en
  - zh
pipeline_tag: text-to-image
library_name: diffusers
widget:
  - text: >-
      A close-up portrait of a dog with black, brown, and white fur, a white
      stripe on its forehead, and brown and black markings on its ears, is
      looking directly at the camera with a serious expression. The dog has
      brown eyes with black pupils and a black nose, and its ears are large and
      pointed. The background is blurred and appears to be an outdoor setting
      with green and brown grass and a light grey sky.
    output:
      url: examples/Qwen-Image_00133_.png
  - text: >-
      Close-up food photo of a hybrid snail composed entirely of glossy sticky
      cinnamon buns. The shell is made from a puffy perfectly swirled cinnamon
      bun covered in a thick glossy white glaze. Baked edges with a jagged
      cinnamon bun texture slightly caramelized, dark cinnamon filling inside,
      rich golden brown color. The glaze drips down in thick sweet drops, the
      snail tendrils are made of twisted cinnamon dough, glistening with icing
      sugar, the glaze reflects warm, natural light. The scene is shot in a
      soft, fuzzy kitchen setting, with a hint of freshly baked pastries in the
      background.
    output:
      url: examples/Qwen-Image_00134_.png
  - text: >-
      8-bit pixel art of a pidgeon wearing a lab coat, and a tie. The background
      is large computer server room. The lighting is dark, with most light
      hitting the servers and not the pidgeon.
    output:
      url: examples/Qwen-Image_00136_.png
  - text: >-
      A long tunnel with a high ceiling is seen dimly lit, illuminated by a
      single fluorescent light fixture at the end of the tunnel. The tunnel
      walls are made of corrugated metal and are lined with copper pipes. On the
      left wall, there is a yellow warning sign with a black exclamation mark
      and the text "WARNING - MILITARY TESTING" in black letters. To the right
      of the warning sign, on the right wall, is a green control panel with
      various knobs and switches, and a black and yellow warning tape is
      attached to the control panel. The floor is dark and wet, reflecting the
      light from the fluorescent light. A metal grate is visible on the floor.
    output:
      url: examples/Qwen-Image_00137_.png
tags:
  - Qwen-Image
  - distillation
  - LoRA
  - merge

50/50 merge of the 4-step and 8-step LoRA

Prompt
A close-up portrait of a dog with black, brown, and white fur, a white stripe on its forehead, and brown and black markings on its ears, is looking directly at the camera with a serious expression. The dog has brown eyes with black pupils and a black nose, and its ears are large and pointed. The background is blurred and appears to be an outdoor setting with green and brown grass and a light grey sky.
Prompt
Close-up food photo of a hybrid snail composed entirely of glossy sticky cinnamon buns. The shell is made from a puffy perfectly swirled cinnamon bun covered in a thick glossy white glaze. Baked edges with a jagged cinnamon bun texture slightly caramelized, dark cinnamon filling inside, rich golden brown color. The glaze drips down in thick sweet drops, the snail tendrils are made of twisted cinnamon dough, glistening with icing sugar, the glaze reflects warm, natural light. The scene is shot in a soft, fuzzy kitchen setting, with a hint of freshly baked pastries in the background.
Prompt
8-bit pixel art of a pidgeon wearing a lab coat, and a tie. The background is large computer server room. The lighting is dark, with most light hitting the servers and not the pidgeon.
Prompt
A long tunnel with a high ceiling is seen dimly lit, illuminated by a single fluorescent light fixture at the end of the tunnel. The tunnel walls are made of corrugated metal and are lined with copper pipes. On the left wall, there is a yellow warning sign with a black exclamation mark and the text "WARNING - MILITARY TESTING" in black letters. To the right of the warning sign, on the right wall, is a green control panel with various knobs and switches, and a black and yellow warning tape is attached to the control panel. The floor is dark and wet, reflecting the light from the fluorescent light. A metal grate is visible on the floor.

My recommended settings

  • LoRA Strength: 0.9 (or possibly even lower)
  • Steps: 16
  • Sampler: DEIS
  • Scheduler: KL Optimal
  • Shift: None (I removed the node, since it made no difference after I swapped to KL Optimal scheduler.)

Reason for making

The 4-step LoRA does fairly well at 4 steps, but it cannot go higher than 4 steps without overcooking the image, and even at 4 steps the image feels a little cooked.

4-step lora | 4 steps vs. 8 steps vs. 16 steps | [1536x1536, no shift, lora strength 1, deis, kl_optimal, seed 187]

The 8-step LoRA on the other hand is very undercooked at 4 steps, still a little undercooked at 8 steps, but handles higher step counts like 16 really well, but feels a little undercooked overall.

8-step lora | 4 steps vs. 8 steps vs. 16 steps | [1536x1536, no shift, lora strength 1, deis, kl_optimal, seed 187]

Merging these two together results in being able to do 16 without overcooking or undercooking. It feels just about right, especially if you load at 90% strength.

merged lora | 4 steps vs. 8 steps vs. 16 steps | [1536x1536, no shift, lora strength 1, deis, kl_optimal, seed 187]