Scanning documents with OCR to make the text searchable


OVERVIEW

Target audience: Entire McGill community

When you scan a document that contains text, using the uPrint machines on campus, it results in a non-editable PDF file that is emailed to you.

If you want the resulting PDF to contain searchable, editable text, you need to turn on the Optical Character Recognition (OCR) settings on the uPrint device before scanning.

The instructions below are based on the Xerox model 5735; other models may differ slightly.

  1. Place the original document face up in the document feeder, or face down on the glass.
  2. Press Services Home on the control panel; then touch E-mail on the touch screen.
  3. Touch the Email Options Tab
  4. Select File format... PDF, Multi-page, Image Only.
  5. On the next screen, under document options, switch from Image to Searchable. The description will indicate that the document will be scanned with an Optical Character Recognition algorithm to make it searchable.
  6. Press Document Language... and select the appropriate language for the document.
  7. Then press Save and start your scan.


references

ADDITIONAL REFERENCES: