Update README.md
Browse files
README.md
CHANGED
|
@@ -12,21 +12,20 @@ license_link: LICENSE
|
|
| 12 |
<img src="ad3.jpg" alt="00205_" />
|
| 13 |
</div>
|
| 14 |
|
| 15 |
-
Bokeh 3.5 Medium is
|
| 16 |
|
| 17 |
This model is released under the Stability Community License.
|
| 18 |
For more details, visit [Tensor.Art](https://tensor.art) or [TusiArt](https://tusiart.com) to explore additional resources and useful information.
|
| 19 |
|
| 20 |
## Overview
|
| 21 |
|
| 22 |
-
-
|
| 23 |
-
-
|
| 24 |
-
- **Short Captions:** Focus on core image
|
| 25 |
-
- **Long Captions:** Provide broader scene
|
| 26 |
- **Recommended Resolutions:**
|
| 27 |
`1920x1024`, `1728x1152`, `1152x1728`, `1280x1664`, `1440x1440`
|
| 28 |
-
- **
|
| 29 |
-
- **Supports LoRA fine-tuning.**
|
| 30 |
|
| 31 |
## Advantages
|
| 32 |
|
|
@@ -61,20 +60,9 @@ For more details, visit [Tensor.Art](https://tensor.art) or [TusiArt](https://tu
|
|
| 61 |
- **Main subject** (e.g., `"Close-up of a macaw"`)
|
| 62 |
- **Detailed features** (e.g., `"vivid feathers, sharp beak"`)
|
| 63 |
- **Background environment** (e.g., `"dimly lit environment"`)
|
| 64 |
-
- **Atmospheric description** (e.g., `"soft warm lighting, cinematic mood"`)
|
| 65 |
-
|
| 66 |
-
### Best Practices:
|
| 67 |
-
- **Avoid overly complex prompts**, as the model already has strong text encoding. Overloading details can cause **T5 hallucination artifacts**, reducing image quality.
|
| 68 |
-
- **Do not use excessively short prompts** (e.g., single words or 2-3 tokens) unless combined with **LoRA or Image2Image (i2i)** techniques.
|
| 69 |
-
- **Avoid mixing too many unrelated concepts**, as this can lead to visual distortions and unwanted artifacts.
|
| 70 |
- **Optimal token length:** **30-70 tokens**.
|
| 71 |
|
| 72 |
-
### Negative Prompting
|
| 73 |
-
- **Negative prompts strongly influence image quality.**
|
| 74 |
-
- Ensure they **do not contradict the main subject** to avoid degrading the output.
|
| 75 |
-
|
| 76 |
-
|
| 77 |
-
|
| 78 |
## Example Output
|
| 79 |
Using diffusers:
|
| 80 |
```python
|
|
@@ -106,15 +94,6 @@ For **LoRA fine-tuning**, the following tools and settings are recommended:
|
|
| 106 |
- **Kohya_ss:** [GitHub Repository](https://github.com/bmaltais/kohya_ss.git)
|
| 107 |
- **Simple Tuner:** [GitHub Repository](https://github.com/bghira/SimpleTuner)
|
| 108 |
|
| 109 |
-
### ⚙️ Suggested Training Settings
|
| 110 |
-
```bash
|
| 111 |
-
--Resolution 1440x1440
|
| 112 |
-
--t5xxl_max_token_length 154
|
| 113 |
-
--optimizer_type AdamW8bit
|
| 114 |
-
--mmdit_lr 1e-4
|
| 115 |
-
--text_encoder_lr 5e-5
|
| 116 |
-
```
|
| 117 |
-
|
| 118 |
## Contact
|
| 119 |
* Website: https://tensor.art https://tusiart.com
|
| 120 |
* Developed by: TensorArt
|
|
|
|
| 12 |
<img src="ad3.jpg" alt="00205_" />
|
| 13 |
</div>
|
| 14 |
|
| 15 |
+
Bokeh 3.5 Medium is based on **Stable Diffusion 3.5 Medium** as its foundation model, using a 5M high-resolution open-source dataset that underwent rigorous quality and **aesthetic screening** for post-training, ensuring **excellent image quality**, **high fidelity of natural images**, preservation of fine **details**, and enhanced **controllability**.
|
| 16 |
|
| 17 |
This model is released under the Stability Community License.
|
| 18 |
For more details, visit [Tensor.Art](https://tensor.art) or [TusiArt](https://tusiart.com) to explore additional resources and useful information.
|
| 19 |
|
| 20 |
## Overview
|
| 21 |
|
| 22 |
+
- Continued training on **SD3.5M**, utilizing carefully curated high-resolution training data to achieve excellent image quality.
|
| 23 |
+
- Trained with mixed short/long natural language captions.
|
| 24 |
+
- **Short Captions:** Focus on the core subject content of the image.
|
| 25 |
+
- **Long Captions:** Provide broader descriptions of the scene environment and atmosphere.
|
| 26 |
- **Recommended Resolutions:**
|
| 27 |
`1920x1024`, `1728x1152`, `1152x1728`, `1280x1664`, `1440x1440`
|
| 28 |
+
- Powerful customized **fine-tuning performance** that can be widely used for **downstream production tasks**.
|
|
|
|
| 29 |
|
| 30 |
## Advantages
|
| 31 |
|
|
|
|
| 60 |
- **Main subject** (e.g., `"Close-up of a macaw"`)
|
| 61 |
- **Detailed features** (e.g., `"vivid feathers, sharp beak"`)
|
| 62 |
- **Background environment** (e.g., `"dimly lit environment"`)
|
| 63 |
+
- **Atmospheric description** (e.g., `"soft warm lighting, cinematic mood"`)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 64 |
- **Optimal token length:** **30-70 tokens**.
|
| 65 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 66 |
## Example Output
|
| 67 |
Using diffusers:
|
| 68 |
```python
|
|
|
|
| 94 |
- **Kohya_ss:** [GitHub Repository](https://github.com/bmaltais/kohya_ss.git)
|
| 95 |
- **Simple Tuner:** [GitHub Repository](https://github.com/bghira/SimpleTuner)
|
| 96 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 97 |
## Contact
|
| 98 |
* Website: https://tensor.art https://tusiart.com
|
| 99 |
* Developed by: TensorArt
|