linux program to convert PDF to text

sktsee sktsee at tulsaconnect.com
Thu Feb 11 02:26:11 UTC 2010


On Wed, 10 Feb 2010 17:02:19 -0800, Robert Swanson wrote:

> I need to know if there is a Linux program to convert PDF to text that
> preserves the line breaks?
> 	I am presently using a terminal program called PDFtotext, but it
> 	doesn't
> recognize line breaks which makes it necessary to manually go through
> and format the text properly.  Because of the number of files that need
> to be converted at Grolaw.net to preserve the documentation from the SCO
> vs Linux trial amongst other things, it would really be helpful to know
> if there was a better program.
> Thanks
> Bob

I just downloaded a pdf file from groklaw here:
http://www.groklaw.net/pdf2/Novell-629.pdf
and issued 

$ pdftotext -layout -eol unix -nopgbrk Novell-629.pdf novell-629.txt

Is this what you are needing, or something that adds additional 
formatting? 

-- 
sktsee





More information about the ubuntu-users mailing list