Page 1 of 1

OCR - turning your photo into text

Posted: Fri May 31, 2013 12:08 am
by mikerr
I thought I'd throw a quick HOWTO for converting a page of a book into text -Optical Character Recognition (OCR)

Code: Select all

sudo apt-get install tesseract-ocr
Take a photo of your page:

Code: Select all

raspistill -o page.jpg
Convert it to text (OCR):

Code: Select all

tesseract page.jpg pagetext
This can take the Pi anything from 1 to 10 minutes depending on the image...

Then you have a text file called pagetext.txt

Code: Select all

cat pagetext.txt

Re: OCR - turning your photo into text

Posted: Fri May 31, 2013 10:58 am
by gkreidl
OCR is one of the few things where linux is a decade behind professional solutions on Windows computers (one of the very few reasons why I'm still using them). tesseract is working in a way, but at least for my demands not really usable.

Re: OCR - turning your photo into text

Posted: Tue May 13, 2014 3:09 am
by parham bahramsari
hi just a question can I use "tesseract" from c++ as well ?

Re: OCR - turning your photo into text

Posted: Thu Dec 10, 2015 3:33 am
by ayushboss
When I use these commands, I get some random characters instead. How can I fix this?