<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
<small>Thank you, Michael McIntyre and Donn. <br>
pdftotext works beautifully!<br>
jdh<br>
</small><br>
D. Michael McIntyre wrote:
<blockquote
cite="mid200701281723.49873.michael.mcintyre@rosegardenmusic.com"
type="cite">
<pre wrap="">On Sunday 28 January 2007 4:29 pm, Donn wrote:
</pre>
<blockquote type="cite">
<blockquote type="cite">
<pre wrap="">I have a few articles in .pdf format I'd need to convert (or OCR) to
plain text, .odf or .doc format.
Any advice for this Linux newbie?
</pre>
</blockquote>
<pre wrap="">Hi, check what these commands give you:
pdftotext
or
pdftohtml
</pre>
</blockquote>
<pre wrap=""><!---->
Same thing I was going to suggest, more or less. I use pdftohtml, then load
the HTML into OpenOffice and export it or save as or whatever to convert it
to OO.o-native format. (I think you have to export it or send it, so you
don't wind up with an OO.o document that still behaves like a one-page HTML
file, but I can't remember the details, and I just closed OO.o, and am too
lazy to sit here while it warms back up.)
</pre>
</blockquote>
<br>
</body>
</html>