<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type"> </head> <body bgcolor="#ffffff" text="#000000"> Thank you, Michael McIntyre and Donn. pdftotext works beautifully! jdh D. Michael McIntyre wrote: <blockquote cite="mid200701281723.49873.michael.mcintyre@rosegardenmusic.com" type="cite"> <pre wrap="">On Sunday 28 January 2007 4:29 pm, Donn wrote: </pre> <blockquote type="cite"> <blockquote type="cite"> <pre wrap="">I have a few articles in .pdf format I'd need to convert (or OCR) to plain text, .odf or .doc format. Any advice for this Linux newbie? </pre> </blockquote> <pre wrap="">Hi, check what these commands give you: pdftotext or pdftohtml </pre> </blockquote> <pre wrap=""> Same thing I was going to suggest, more or less. I use pdftohtml, then load the HTML into OpenOffice and export it or save as or whatever to convert it to OO.o-native format. (I think you have to export it or send it, so you don't wind up with an OO.o document that still behaves like a one-page HTML file, but I can't remember the details, and I just closed OO.o, and am too lazy to sit here while it warms back up.) </pre> </blockquote> </body> </html>