I have experimented with both pypdf and pdfMiner to extract text from PDF files.

Question

0

Asked: June 11, 20262026-06-11T22:43:07+00:00 2026-06-11T22:43:07+00:00

I have experimented with both pypdf and pdfMiner to extract text from PDF files.

0

I have experimented with both pypdf and pdfMiner to extract text from PDF files. I have some unfriendly PDFs that only pdfMiner is able to extract successfully. I am using the code here to extract text for the entire file. However, I would really like to extract text on a per page basis like the pages[i].extract_text() functionality in pypdf. Does anyone know how to extract text per page using pdfMiner?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-11T22:43:08+00:00

Editorial Team

2026-06-11T22:43:08+00:00Added an answer on June 11, 2026 at 10:43 pm

for pageNumber, page in enumerate(PDFDocument.get_pages()):
    if pageNumber == 42:
        #do something with the page

There is a pretty good article here.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have experimented with both pypdf and pdfMiner to extract text from PDF files.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply