What I want is a way to parse a PDF file into HTML with

Question

Asked: June 13, 20262026-06-13T01:48:17+00:00 2026-06-13T01:48:17+00:00

What I want is a way to parse a PDF file into HTML with the image map (the hyperlinks) and the images must be in jpg format.

I have a Magazine Reader and I need the images and the position, href and size of each hyperlink.

The solution needs to be to run into a linux server.

Any suggestions?
Many thanks!

You must login to add an answer.

Need An Account,

Editorial Team · Answer 1 · 2026-06-13T01:48:18+00:00

Editorial Team

You should take a look to the pdf2html project or pdf2htmlEX.

That needs some tweaks to convert png to jpg as well.

This is that simple as :

convert foo.png foo.jpg

The Archive Base Latest Questions