<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
On 10/09/2011 01:34 PM, Kevin O'Gorman wrote:
<blockquote
cite="mid:CAGVXcSZ_z_8zr25++dwsHePfpZde-=AgwT2FLU7qtVZLwxLrPw@mail.gmail.com"
type="cite">I'm new to OCR (optical character reading), have never
done it before. Suddenly I have a need.<br>
<br>
I've been diving through old papers and have found hard-copy
(appears to be real Courier font, laser printed on white
background) of a program I wrote decades ago on a Macintosh 512K
in Lightspeed C. I thought I had lost it completely. I would
like to recover it from the hard-copy without typing ~100 pages of
code. I have a scanner, and full Acrobat CS5 on a Windows
machine, plus all the FOSS of Ubuntu (tesseract, gocr, plus
anything useful in multiverse). Does anybody know the fastest way
to usable code from this situation?<br clear="all">
<br>
-- <br>
Kevin O'Gorman, PhD<br>
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
</blockquote>
<tt>On Windows I have an old Nuance program that works pretty well,
but it doesn't come cheap.<br>
Also, Nuance will call you up on the phone and bother you every so
often. There is<br>
also Foxit Reader, and ABBYY Fine Reader. In any case, even tho it
works well, you will <br>
have to proof-read everything very carefully. OCR programs are not
perfect. Probably <br>
someone will know of a Linux program, but in this case, you may
find you have to go <br>
commercial to get the best performance. <br>
<br>
NB: When you scan, *make sure* that the page is absolutely
straight. Just a couple degrees<br>
off of the vertical and the OCR goes to hell in a hurry.<br>
<br>
--doug<br>
<br>
</tt><br>
<pre class="moz-signature" cols="72">--
Blessed are the peacemakers...for they shall be shot at from both sides. --A. M. Greeley</pre>
</body>
</html>