I am looking for a library (if possible available in Java or PHP) in

Question

0

Asked: May 17, 20262026-05-17T00:36:20+00:00 2026-05-17T00:36:20+00:00

I am looking for a library (if possible available in Java or PHP) in

0

I am looking for a library (if possible available in Java or PHP) in order to extract text from a PDF. There is a lot of software available, including:

3-Heights™ PDF Extract http://www.pdf-tools.com/pdf/pdf-extract-content-metadata-text.aspx
PDFlib TET – Text Extraction Toolkit http://www.pdflib.com/products/tet/
PDF2XML http://sourceforge.net/projects/pdf2xml/

Which tools would you choose? What do you think of them?

Thank you very much for your kind help!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-17T00:36:21+00:00

Editorial Team

2026-05-17T00:36:21+00:00Added an answer on May 17, 2026 at 12:36 am

My favourite is iText (java) but extracting text from a PDF can be fraught with difficulties as the text in the PDF is not alway stored in the order in which it appears.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am looking for a library (if possible available in Java or PHP) in

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply