Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,53 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
license_name: stabilityai-ai-community
|
| 5 |
+
license_link: LICENSE.md
|
| 6 |
+
library_name: diffusers
|
| 7 |
+
pipeline_tag: text-to-image
|
| 8 |
+
tags:
|
| 9 |
+
- text-to-image
|
| 10 |
+
base_model:
|
| 11 |
+
- suzushi/miso-diffusion-m-1.0
|
| 12 |
+
- stabilityai/stable-diffusion-3.5-medium
|
| 13 |
+
---
|
| 14 |
+
<div style="display: flex; justify-content: center; gap: 20px; margin-bottom: 20px;">
|
| 15 |
+
<img src="demo1.png" width="400" />
|
| 16 |
+
<img src="demo2.png" width="400" />
|
| 17 |
+
</div>
|
| 18 |
+
# Anime SD3.5 medium Model
|
| 19 |
+
An attempt to fine tune sd3.5 medium
|
| 20 |
+
## Version History
|
| 21 |
+
|
| 22 |
+
| Version | Base Training | Aesthetic Training | Total Epochs |
|
| 23 |
+
|---------|--------------|-------------------|--------------|
|
| 24 |
+
| alpha | 250K images | 0 images | 1 |
|
| 25 |
+
| beta | 160K images | 0 images | 3 |
|
| 26 |
+
| 1.0 | 600k images | 0 images | 2 + (3 from beta) |
|
| 27 |
+
| 1.1 | 710k images | 0 images | 5 |
|
| 28 |
+
| 2.0 | 1.08M images | 0 images | 5 |
|
| 29 |
+
|
| 30 |
+
## Training Methodology
|
| 31 |
+
|
| 32 |
+
Training is done on gh200 with 96gb vram, now that prior training shows
|
| 33 |
+
decent results, I am slightly increasing learning rate.
|
| 34 |
+
|
| 35 |
+
Training setting: Adafactor with a batchsize of 40, lr_scheduler: cosine
|
| 36 |
+
SD3.5 Specific setting:
|
| 37 |
+
enable_scaled_pos_embed = true
|
| 38 |
+
|
| 39 |
+
pos_emb_random_crop_rate = 0.2
|
| 40 |
+
|
| 41 |
+
weighting_scheme = "flow"
|
| 42 |
+
learning_rate = 8e-6
|
| 43 |
+
|
| 44 |
+
learning_rate_te1 = 5e-6
|
| 45 |
+
|
| 46 |
+
learning_rate_te2 = 5e-6
|
| 47 |
+
|
| 48 |
+
Train Clip: true, Train t5xxl: false
|
| 49 |
+
|
| 50 |
+
## Support Me
|
| 51 |
+
At the moment training an epoch cost around 130 dollars. If you like my project please consider supporting me: https://ko-fi.com/suzushi2024
|
| 52 |
+
|
| 53 |
+
Lastly, huge thanks to meg who has been supporting this project, without him this project would not have been possible !
|