W3C home > Mailing lists > Public > xproc-dev@w3.org > July 2011

RE: What tool is recommended to convert pdf to html

From: Geert Josten <geert.josten@daidalos.nl>
Date: Mon, 25 Jul 2011 15:55:50 +0200
To: Alex Muir <alex.g.muir@gmail.com>, XProc Dev <xproc-dev@w3.org>
Message-ID: <B26C615F8546A84C81165A7BC8BE61A020F39D87C4@EXMBXC01.ms-hosting.nl>
Hi Alex,

Bit off-topic, but what the heck.. How detailed does the conversion need to be? There are literally hundreds of tools, but they suite various purposes. You could look in the area of OCR and closely-related tools to extract high detail, but there are also plenty tools that do text extraction, just for searching purposes.

Kind regards,
Geert

Van: xproc-dev-request@w3.org [mailto:xproc-dev-request@w3.org] Namens Alex Muir
Verzonden: maandag 25 juli 2011 15:45
Aan: XProc Dev
Onderwerp: What tool is recommended to convert pdf to html

Hi,

I'm wondering what tool would be recommended to convert pdf to html or xml effectively for a process to convert a whole bunch of pdf.

Regards


--
Alex Muir
Instructor | Program Organizer - University Technology Student Work Experience Building
University of the Gambia
http://sites.utg.edu.gm/alex/<https://sites.google.com/a/utg.edu.gm/utsweb/>

Low budget software development benefiting development in the Gambia, West Africa
Experience of a lifetime, come to Gambia and Join UTSWEB - http://sites.utg.edu.gm/utsweb/<https://sites.google.com/a/utg.edu.gm/utsweb/>
Received on Monday, 25 July 2011 13:56:33 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 25 July 2011 13:56:33 GMT