Re: Unix cmd line utility for Multibyte PDF -> Text

fyi - PDFBox did not work [for me], but I was then referred to 'xpdf' at 
http://www.foolabs.com/xpdf/ - and it works very nicely.

Thanks,

~mm

cstrobbe wrote:
> Hi Michael,
> 
> 
> Quoting Michael Monaghan <Michael.Monaghan@Sun.COM>:
> 
> 
>>Hi,
>>
>>I need a pdf -> text command line utility for Unix/Solaris that
>>won't corrupt non-ASCII characters.
> 
> 
> 
> A few years ago I used PDFBox, a Java PDF library, to extract text from 
> PDF (http://www.pdfbox.org/). I seem to remember that it also worked 
> for non-ASCII characters.
> 
> Best regards,
> 
> Christophe
> 

Received on Monday, 23 October 2006 16:01:43 UTC