Optical Character Recognition (OCR) Technology

Every receipt, bill, and statement that enters Hubdoc goes through our optical character recognition (OCR) process. Computer software mines these documents for data, and then Hubdoc adds in some human quality assurance to certify that the correct information is extracted.

Extractor means no data entry or filing for you! It's paperless, so you're saving the earth!


The extraction process generally takes fewer than 24 hours, and then the date, amount, and vendor's name is stored in Hubdoc along with the document.

Was this article helpful?
2 out of 4 found this helpful
Have more questions? Submit a request


  • Avatar
    Christopher Harris

    Can you make it clear that I should wait for this process to occur before publishing to Xero? If I publish to Xero before the OCR has occured then the bill created in Xero has nothing in it except the grand total which *I* had to manually enter in Hubdoc.

  • Avatar
    Andrew Pendleton

    Does it help or hinder when receipts are previously scanned with ScanSnapp and formatted as a "searchable PDF" by Abbyy FIneReader?

  • Avatar
    Morgan Neff

    Will the OCR process extract account or customer id numbers on a statement or invoice (I.e. Utility bill)? If so, where in the details section would it appear? If not, where could that information be manually entered as to sync properly with quickbooks online?

Powered by Zendesk