I am trying to use the AbcPdf .net component (version 7) to process some PDFs and generate metadata. I was wondering if there is anyway to list all the tags in a pdf document? As an example of a tagged pdf, I am using this file here
Are there any other components or tools available for listing or extracting pdf tags?
Thanks in advance for you help
Use iTextSharp. It’s free and you only need the “itextsharp.dll”.
http://sourceforge.net/projects/itextsharp/
Here is a simple function for reading the text out of a PDF.
ITextSharp also has methods for dealing with tags.