Domain Specific Preprocessing Constraints
Medical Imaging Constraints
Medical images have strict preprocessing requirements. DICOM format contains metadata critical for interpretation. Window/level settings control how pixel values map to display. Wrong preprocessing leads to missed diagnoses.
Key constraints: Preserve original pixel values for regulatory compliance. Document all transformations applied. Avoid lossy augmentations that could mask pathology. Maintain aspect ratios that clinicians expect.
Satellite and Aerial Imagery
Satellite images have 12-16 bits per channel versus 8 bits for consumer photos. Spectral bands beyond visible RGB provide crucial information. Standard preprocessing designed for 8-bit RGB destroys this data.
Key constraints: Preserve full bit depth during preprocessing. Handle multiple spectral bands (not just RGB). Account for varying sun angles and atmospheric conditions across images. Geospatial metadata must survive preprocessing.
Document and Text Images
Text recognition requires different preprocessing than object classification. Orientation matters - upside-down text must be detected and corrected before OCR. Horizontal flipping is forbidden because it creates invalid characters.
Key constraints: Deskewing aligns rotated documents. Binarization separates text from background. Noise removal helps but aggressive filtering destroys thin strokes. Preserve aspect ratio - stretching distorts character shapes.
Choosing Domain-Appropriate Preprocessing
Start with domain expert consultation. Ask what transformations are acceptable for the specific use case. Test preprocessing on edge cases before training. Monitor for cases where preprocessing destroys critical information.