[ubuntu-za] How to copy text from PDF file (URGENT)
William Walter Kinghorn
williamk at dut.ac.za
Thu May 12 05:54:30 UTC 2011
Hi Ian,
I use gImageReader : http://sourceforge.net/projects/gimagereader/
Features
Open images and PDFs
Acquire from scanner
Select the part of the image to recognize
Support for different recognition languages
Side by side comparison of source image and output text
Remove linebreaks in output text
Supports tesseract 3.0
Searched Google for "ubuntu convert pdf"
found these for you to try, if gImageReader does not work for you
http://embraceubuntu.com/2007/04/10/convertimport-from-pdf-and-keep-the-formatting/
http://www.ubuntugeek.com/howto-convert-pdf-files-to-html-files.html
http://www.linuxquestions.org/questions/linux-software-2/pdf-to-doc-converter-344569/
http://ubuntuforums.org/showthread.php?t=199201
http://shibuvarkala.blogspot.com/2008/11/howto-convert-pdf-to-txt-in-ubuntu.html
Hope this helps
William
________________________________________
From: ubuntu-za-bounces at lists.ubuntu.com [ubuntu-za-bounces at lists.ubuntu.com] On Behalf Of Ian Whitfield [whitfield at federalsaints.net]
Sent: 11 May 2011 20:43
To: Ubuntu List
Subject: [ubuntu-za] How to copy text from PDF file (URGENT)
Hi all
I'm running Kubuntu 10.04 and have to copy text from PDF files.
Some work, others don't so they must have copy protection set. I Googled
for ideas and have tried different Readers, PDF2PS and then PS2PDF, Open
in Gimp and save as JPG and then open in OO and convert to PDF again.
Nothing works for me!!
Has anyone got a foolproof way of doing this as it is rather urgent as I
have to finish a project.
Thanks a lot
Ian Whitfield.
/
///
--
ubuntu-za mailing list
ubuntu-za at lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-za
"This e-mail is subject to our Disclaimer, to view click http://www.dut.ac.za"
More information about the ubuntu-za
mailing list