I want to extract some specified text in pdf files and the text position.

Question

0

Editorial Team

Asked: May 27, 20262026-05-27T07:15:43+00:00 2026-05-27T07:15:43+00:00

I want to extract some specified text in pdf files and the text position.

0

I want to extract some specified text in pdf files and the text position.

I know xpdf and mupdf can parse pdf files,so i think they may help me to fulfill this task.

But how to use these two lib to get text position?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T07:15:44+00:00

Editorial Team

2026-05-27T07:15:44+00:00Added an answer on May 27, 2026 at 7:15 am

Mupdf comes with a couple of tools, one being pdfdraw.

If you use pdfdraw with the -tt option, it will generate an XML containing all characters and their exact positioning information.
From there you should be able to find what you need.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to extract some specified text in pdf files and the text position.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply