Spaces:
Running
Running
| # OCR Preprocessing Triage | |
| ## Quick Fixes Implemented | |
| 1. **Handwritten** - Disabled thresholding, uses grayscale only | |
| 2. **Newspapers** - Increased block size (51) and constant (10) for softer thresholding | |
| 3. **JPEG Artifacts** - Auto-detection and specialized denoising | |
| 4. **Border Issues** - Crops edges after deskew to avoid threshold problems | |
| 5. **Low Resolution** - Upscales small text for better recognition | |
| ## Testing | |
| ``` | |
| python testing/test_triage_fix.py | |
| ``` | |
| Check `output/comparison/` for results. | |