Is there a way to find the source from which any PDF has been built ? The reason being if it has been built from an XML then i want to know whether i can get the XML back from the PDF and parse the XML ?
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
No, there is no way to do that. PDF is a presentation format, not a data storage format, and in general, PDF lacks structure of the data it presents. You may not even have words or phrases inside a PDF file. A PDF could be considered (in an utterly-over-simplified way) a sequence of instructions like:
-Draw character ‘a’ in coordinates 10, 30
-Move the pen to the point 40,40
-Draw a line from the current point to the point 50,50
…