trying to OCR a simple tif file with tesseract-ocr
Robert P. J. Day
rpjday at crashcourse.ca
Sat Mar 26 12:30:41 UTC 2011
as part of a larger workflow, i need to OCR process some simple text
files and, as i read it, tesseract is the OCR tool of choice for
ubuntu. so i'm following the instructions here:
https://help.ubuntu.com/community/OCR
but i can't seem to get any output. i've taken a B/W screenshot of
some text, it's saved in a grayscale/1-layer .tif file, at which point
i run:
$ tesseract tess.tif output
i get no diagnostics but the output file is always empty. does
someone have a suggestion for an absurdly simple example of using
tesseract to get text out of a tif file? i'm sure i'm missing
something simple but critical.
rday
p.s. i haven't changed any of tesseract's config file info since the
documentation *seems* to suggest i don't need to, but i'm willing to
be corrected on that.
--
========================================================================
Robert P. J. Day Waterloo, Ontario, CANADA
http://crashcourse.ca
Twitter: http://twitter.com/rpjday
LinkedIn: http://ca.linkedin.com/in/rpjday
========================================================================
More information about the ubuntu-users
mailing list