luxdelux7 commited on
Commit
ef0c7b6
·
verified ·
1 Parent(s): 726f767

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -17
README.md CHANGED
@@ -17,36 +17,75 @@ Made for the **Forbidden Vision** ComfyUI custom nodes
17
 
18
  ---
19
 
20
- ## Dataset
21
 
22
- All three models share a core mixed-domain dataset specifically curated for diffusion-generated content:
23
 
24
- - images from CivitAI diffusion outputs (SDXL, SD1.5, Pony, Illustrious)
25
- - curated images from Danbooru (mixed anime styles)
26
- - real photographs from various sources
27
- - NSFW images from each domain without filtering
 
 
28
 
29
- **Total dataset size:** ~11k manually annotated images
30
-
31
- This mixed approach ensures the models work reliably across:
32
- - ✓ Both realistic and anime art styles
33
- - ✓ Difficult angles, occlusions, and expressions
34
- - ✓ Low-quality generations and artifacts
35
- - ✓ SFW and NSFW content equally
36
- - ✓ Edge cases that break traditional face detectors
37
 
38
  ---
39
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
40
  ## Model Details
41
 
42
  ### Face Detection (YOLOv11-Small)
43
 
44
- **Purpose:** Primary face detection and bounding box localization
45
 
46
  **Training Approach:**
47
- - [TRAINING DETAILS PLACEHOLDER]
 
48
  - Trained at 640px resolution (inference should use same resolution)
49
- - [AUGMENTATION DETAILS PLACEHOLDER]
50
 
51
  **Why YOLOv11-Small instead of nano?**
52
  More reliable detection across mixed realistic/anime domains with acceptable speed tradeoff.
 
17
 
18
  ---
19
 
20
+ ## 🎯 Why These Models Exist
21
 
22
+ Traditional face models fail where it matters most for AI art workflows:
23
 
24
+ | **Problem** | **Why It Matters** |
25
+ |-------------|-------------------|
26
+ | 🎨 **Domain-locked** | Existing models excel at *either* anime *or* realistic—never both |
27
+ | 🔞 **NSFW blindness** | Most models trained only on SFW data break on adult content |
28
+ | 🎲 **Generation artifacts** | Standard datasets don't include diffusion model quirks and failures |
29
+ | 🔄 **No rotation handling** | Can't automatically correct face angles across mixed styles |
30
 
31
+ **These models solve all four.**
 
 
 
 
 
 
 
32
 
33
  ---
34
 
35
+ ## 📊 Training Foundation
36
+
37
+ ### The Dataset Difference
38
+
39
+ Built from **11,000+ manually annotated images** across the domains that actually matter for AI generation:
40
+
41
+ <table>
42
+ <tr>
43
+ <td width="50%">
44
+
45
+ **🎨 Multi-Domain Coverage**
46
+ - SDXL, SD1.5, Pony, Illustrious outputs
47
+ - Curated Danbooru (anime styles)
48
+ - Real photography
49
+ - Full NSFW inclusion (no filtering)
50
+
51
+ </td>
52
+ <td width="50%">
53
+
54
+ **💎 Edge Case Priority**
55
+ - ✓ Extreme angles & occlusions
56
+ - ✓ Failed/broken generations
57
+ - ✓ Low-quality artifacts
58
+ - ✓ Unusual expressions & poses
59
+ - ✓ Everything other models ignore
60
+
61
+ </td>
62
+ </tr>
63
+ </table>
64
+
65
+ ### What This Means For You
66
+
67
+ ```
68
+ Traditional models: Trained on clean celebrity faces
69
+
70
+ Fail on real workflows
71
+
72
+ These models: Trained on what you actually generate
73
+
74
+ Work when you need them
75
+ ```
76
+
77
+ **One model family. Every domain. Zero compromises.**
78
+
79
  ## Model Details
80
 
81
  ### Face Detection (YOLOv11-Small)
82
 
83
+ **Purpose:** Primary face detection with high recall and very tight face boxes
84
 
85
  **Training Approach:**
86
+ - After every training run, I ran the model on a new mixed dataset, hardmining failures and improving the dataset until an acceptable performance was reached
87
+ - Used offline custom augmentation on the initial set to complement light yolo training script augmentations
88
  - Trained at 640px resolution (inference should use same resolution)
 
89
 
90
  **Why YOLOv11-Small instead of nano?**
91
  More reliable detection across mixed realistic/anime domains with acceptable speed tradeoff.