I want parse pdf for form field names and types. Is it possible? Because when I tried one PDF, it gave me some strange characters e.g.:
…
?õ»â¢_¸ðO´×¢É]Ì|BQÔQClã(¢dVò¶~?ýg?þª í
pÅ2ÞÎÉÍ??Ú?wȳ.?d;k)*lÙ´¸(ò!ú©=ià??d?éPض2Èåäý?»p?nÜÈûÏ??M
õl:`Þ°Ã3£BíTCy5 ?ð?tN¿7fDõK
±¦?i¹vü~»X?s÷A~Ôê±4?ÕµX±¤?
…
Where could be the problem? I used tool http://support.persits.com/pdf/demo_formfields.asp and pdf https://www.drsr.sk//priznania/dpfoa2010.pdf
I want make some parser for iOS. Thanks for answer.
For PDF parsing on iOS, use the Quartz API.
For an example of an app which makes use of this API, see this reader.
To extract the specific information you’re interested in, you will need to read the PDF document structure specification and figure out which dictionaries it’s in (or, if you’re lucky find some sample code).