Update README.md
Browse files
README.md
CHANGED
|
@@ -17,36 +17,75 @@ Made for the **Forbidden Vision** ComfyUI custom nodes
|
|
| 17 |
|
| 18 |
---
|
| 19 |
|
| 20 |
-
##
|
| 21 |
|
| 22 |
-
|
| 23 |
|
| 24 |
-
|
| 25 |
-
|
| 26 |
-
-
|
| 27 |
-
|
|
|
|
|
|
|
| 28 |
|
| 29 |
-
**
|
| 30 |
-
|
| 31 |
-
This mixed approach ensures the models work reliably across:
|
| 32 |
-
- ✓ Both realistic and anime art styles
|
| 33 |
-
- ✓ Difficult angles, occlusions, and expressions
|
| 34 |
-
- ✓ Low-quality generations and artifacts
|
| 35 |
-
- ✓ SFW and NSFW content equally
|
| 36 |
-
- ✓ Edge cases that break traditional face detectors
|
| 37 |
|
| 38 |
---
|
| 39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
## Model Details
|
| 41 |
|
| 42 |
### Face Detection (YOLOv11-Small)
|
| 43 |
|
| 44 |
-
**Purpose:** Primary face detection and
|
| 45 |
|
| 46 |
**Training Approach:**
|
| 47 |
-
-
|
|
|
|
| 48 |
- Trained at 640px resolution (inference should use same resolution)
|
| 49 |
-
- [AUGMENTATION DETAILS PLACEHOLDER]
|
| 50 |
|
| 51 |
**Why YOLOv11-Small instead of nano?**
|
| 52 |
More reliable detection across mixed realistic/anime domains with acceptable speed tradeoff.
|
|
|
|
| 17 |
|
| 18 |
---
|
| 19 |
|
| 20 |
+
## 🎯 Why These Models Exist
|
| 21 |
|
| 22 |
+
Traditional face models fail where it matters most for AI art workflows:
|
| 23 |
|
| 24 |
+
| **Problem** | **Why It Matters** |
|
| 25 |
+
|-------------|-------------------|
|
| 26 |
+
| 🎨 **Domain-locked** | Existing models excel at *either* anime *or* realistic—never both |
|
| 27 |
+
| 🔞 **NSFW blindness** | Most models trained only on SFW data break on adult content |
|
| 28 |
+
| 🎲 **Generation artifacts** | Standard datasets don't include diffusion model quirks and failures |
|
| 29 |
+
| 🔄 **No rotation handling** | Can't automatically correct face angles across mixed styles |
|
| 30 |
|
| 31 |
+
**These models solve all four.**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
|
| 33 |
---
|
| 34 |
|
| 35 |
+
## 📊 Training Foundation
|
| 36 |
+
|
| 37 |
+
### The Dataset Difference
|
| 38 |
+
|
| 39 |
+
Built from **11,000+ manually annotated images** across the domains that actually matter for AI generation:
|
| 40 |
+
|
| 41 |
+
<table>
|
| 42 |
+
<tr>
|
| 43 |
+
<td width="50%">
|
| 44 |
+
|
| 45 |
+
**🎨 Multi-Domain Coverage**
|
| 46 |
+
- SDXL, SD1.5, Pony, Illustrious outputs
|
| 47 |
+
- Curated Danbooru (anime styles)
|
| 48 |
+
- Real photography
|
| 49 |
+
- Full NSFW inclusion (no filtering)
|
| 50 |
+
|
| 51 |
+
</td>
|
| 52 |
+
<td width="50%">
|
| 53 |
+
|
| 54 |
+
**💎 Edge Case Priority**
|
| 55 |
+
- ✓ Extreme angles & occlusions
|
| 56 |
+
- ✓ Failed/broken generations
|
| 57 |
+
- ✓ Low-quality artifacts
|
| 58 |
+
- ✓ Unusual expressions & poses
|
| 59 |
+
- ✓ Everything other models ignore
|
| 60 |
+
|
| 61 |
+
</td>
|
| 62 |
+
</tr>
|
| 63 |
+
</table>
|
| 64 |
+
|
| 65 |
+
### What This Means For You
|
| 66 |
+
|
| 67 |
+
```
|
| 68 |
+
Traditional models: Trained on clean celebrity faces
|
| 69 |
+
↓
|
| 70 |
+
Fail on real workflows
|
| 71 |
+
|
| 72 |
+
These models: Trained on what you actually generate
|
| 73 |
+
↓
|
| 74 |
+
Work when you need them
|
| 75 |
+
```
|
| 76 |
+
|
| 77 |
+
**One model family. Every domain. Zero compromises.**
|
| 78 |
+
|
| 79 |
## Model Details
|
| 80 |
|
| 81 |
### Face Detection (YOLOv11-Small)
|
| 82 |
|
| 83 |
+
**Purpose:** Primary face detection with high recall and very tight face boxes
|
| 84 |
|
| 85 |
**Training Approach:**
|
| 86 |
+
- After every training run, I ran the model on a new mixed dataset, hardmining failures and improving the dataset until an acceptable performance was reached
|
| 87 |
+
- Used offline custom augmentation on the initial set to complement light yolo training script augmentations
|
| 88 |
- Trained at 640px resolution (inference should use same resolution)
|
|
|
|
| 89 |
|
| 90 |
**Why YOLOv11-Small instead of nano?**
|
| 91 |
More reliable detection across mixed realistic/anime domains with acceptable speed tradeoff.
|