What I want is a way to parse a PDF file into HTML with the image map (the hyperlinks) and the images must be in jpg format.
I have a Magazine Reader and I need the images and the position, href and size of each hyperlink.
The solution needs to be to run into a linux server.
Any suggestions?
Many thanks!
You should take a look to the pdf2html project or pdf2htmlEX.
That needs some tweaks to convert png to jpg as well.
This is that simple as :
with ImageMagick tools.
See the README.