Verifying AI-generated images at scale
A technical deep-dive into how we detect synthetic media without breaking on adversarial inputs.
By Dr. Ananya Roy
Verifying AI-generated images at scale
When people ask how Verify detects AI images, the honest answer is: a stack of overlapping signals. No single detector survives the next-gen generator. So we run an ensemble — and the engineering story is mostly about how we keep the ensemble fast.
The four signal classes
- Generative artefacts. Frequency-domain residuals, latent-space inconsistencies, and learned embedding anomalies.
- Capture-time signals. EXIF, hardware attestation (C2PA), and physical-camera fingerprints.
- Content plausibility. Light direction, shadow geometry, reflection consistency.
- Provenance. Reverse image search, perceptual hash matches against known-synthetic seed sets.
We weight these per request based on observed confidence — a clean smartphone capture with valid C2PA never goes through the expensive ensemble.
Why naive ensembles fail
Most public benchmarks evaluate a single detector on a fixed test set. Production is different: adversaries adapt. We retrain quarterly on red-team outputs and publish our confusion matrices internally — the moment a class slips below 90% TPR, it goes back into the queue.
