All posts
Research·· 10 min read
Document verification beyond OCR
Why optical character recognition is the easy half of document fraud detection, and what the hard half looks like.
By Dr. Ananya Roy
Document verification beyond OCR
OCR extracts text from a document. That tells you what the document says. Document verification asks: is the document real? That is a different problem.
Our document pipeline checks substrate (paper texture, security threads), composition (template alignment, font drift), and provenance (chip data on eMRTD documents). OCR is one of many inputs — and not the most informative one.
#document-verification#kyc
