Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > C++ > Library for parsing pdf docs?

Reply
Thread Tools

Library for parsing pdf docs?

 
 
Bernd Muent
Guest
Posts: n/a
 
      02-16-2006
Hi,
I'm looking for a c/C++ library (suitable for Linux and Windows) to
parse pdf documents. I only need the plain text and defined page breaks
as output.
Parsing trough a system ("pdftotext ..."); does this for me, but I'm
looking for a more elegant way using a library.
xpdf does not come with any library nor API documentation.

Thanks for any hints, Bernd

--
BM Computer-Services, Bergmannstr. 66, 10961 Berlin
Webdesign, Internet, Layout und Grafik
Tel.: 030/20649400, mobil 0175/7419517, Fax: 030/20649401
Web: http://www.bmservices.de, eMail: http://www.velocityreviews.com/forums/(E-Mail Removed)
 
Reply With Quote
 
 
 
 
Victor Bazarov
Guest
Posts: n/a
 
      02-16-2006
Bernd Muent wrote:
> I'm looking for a c/C++ library (suitable for Linux and Windows) to
> parse pdf documents. [..]
>
> Thanks for any hints, Bernd


Hint: www.google.com

V
--
Please remove capital As from my address when replying by mail
 
Reply With Quote
 
 
 
 
Bernd Muent
Guest
Posts: n/a
 
      02-17-2006
Victor Bazarov schrieb:

> Hint: www.google.com


Ha Ha.
I spended 1.5 hours to do that. I found only libraries for creating pdf
files, but none for parsing them to extract words and the pages they
were on.

B.

--
BM Computer-Services, Bergmannstr. 66, 10961 Berlin
Webdesign, Internet, Layout und Grafik
Tel.: 030/20649400, mobil 0175/7419517, Fax: 030/20649401
Web: http://www.bmservices.de, eMail: (E-Mail Removed)
 
Reply With Quote
 
roberts.noah@gmail.com
Guest
Posts: n/a
 
      02-17-2006

Bernd Muent wrote:
> Victor Bazarov schrieb:
>
> > Hint: www.google.com

>
> Ha Ha.
> I spended 1.5 hours to do that. I found only libraries for creating pdf
> files, but none for parsing them to extract words and the pages they
> were on.
>
> B.


Also remember to search code repositories like sourceforge and
freshmeat.

http://freshmeat.net/search/?q=pdf&t...&Go.x=0&Go.y=0

 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Postscript to PDF with pdf-tools, pdf-writer, or other Sean Nakasone Ruby 1 04-14-2008 09:13 PM
PDF::Writer, create pdf and insert in other pdf file. Ricardo Pog Ruby 1 03-26-2008 08:24 PM
PDF Library - Reading the PDF Document Chintakrindi Meghanath Ruby 2 01-09-2006 08:28 AM
Perl expression for parsing CSV (ignoring parsing commas when in double quotes) GIMME Perl 2 02-11-2004 05:40 PM
Fw: PDF library for reading PDF files Peter Galfi Python 14 01-20-2004 05:41 PM



Advertisments