Covid-19 Update

Max are pleased to confirm that our teams are still fully operational in line with the latest government guidelines. This includes our off-site team at ONS. We do understand, however, that for many businesses and institutions this is unfortunately still not the case. We continue to wish you well during this difficult period and look forward to somewhat easier times, hopefully by Spring 2021.

If you would like to schedule a call for a future date, please complete our contact form indicating a preferred day. We have decided to freeze our prices until May 1st 2021 to ensure that our customers are not penalised by the ongoing restrictions. We may decide to extend this date further still.

In the meantime, please do take a look at our podcasts page . The passion, enthusiasm and insight from our guest speakers has been inspirational, especially given the circumstances.

Search the site:

Optical Character Recognition (OCR)

Optical Character Recognition (OCR)
Demo image of Optical Character Recognition (OCR) in progress

Max is able to convert all typewritten text using optical character recognition (“OCR”). We convert paper-based records, microfilm or existing digital images into a searchable pdf format. Our specialised methods give exceptional accuracy, turning your off-line material into a searchable on-line resource.

We are able to handle jobs of all sizes and can work with all kinds of original materials, including bound volumes and broadsheet newspapers. We can output to a variety of formats, including PDF/A, text, MS-Word, XML and HTML.

Our sophisticated OCR system uses pattern recognition algorithms, which identify individual characters. A dictionary-based analysis then enables the system to deduce the content on a word-by-word basis, even where individual characters have not been picked up correctly. The OCR process recognises and retains content layout such as columns, tables and illustrations. This means that the document can be displayed in its original layout on the PDF whilst still being a fully searchable archive.

Some of the clients for whom we have undertaken large-scale OCR projects include:

  • London School of Economics
  • British Universities Film & Video Council
  • Anti-Slavery International Library
  • Greenwich University

For further details of the services we offer, please contact us on 020 8309 5445 or via our contact page.

Optical Character Recognition (OCR)

Max is able to convert all typewritten text using optical character recognition (“OCR”). We convert paper-based records, microfilm or existing digital images into a searchable pdf format. Our specialised methods give exceptional accuracy, turning your off-line material into a searchable on-line resource.

We are able to handle jobs of all sizes and can work with all kinds of original materials, including bound volumes and broadsheet newspapers. We can output to a variety of formats, including PDF/A, text, MS-Word, XML and HTML.

Our sophisticated OCR system uses pattern recognition algorithms, which identify individual characters. A dictionary-based analysis then enables the system to deduce the content on a word-by-word basis, even where individual characters have not been picked up correctly. The OCR process recognises and retains content layout such as columns, tables and illustrations. This means that the document can be displayed in its original layout on the PDF whilst still being a fully searchable archive.

Some of the clients for whom we have undertaken large-scale OCR projects include:

  • London School of Economics
  • British Universities Film & Video Council
  • Anti-Slavery International Library
  • Greenwich University

For further details of the services we offer, please contact us on 020 8309 5445 or via our contact page.

Example of Optical Character Recognition - also known as OCR
Demo image of Optical Character Recognition (OCR) in progress