I am converting pdf files to text using iTextSharp, however I found that if a PDF has embedded fonts or OpenType fonts, I cannot get the text from the PDF. Is there solution for this? I just need to convert to text. Any help is appreciated. Thanks!
Share
As someone who processes thousands of random PDFs from all sorts of diverse clients each month, XpdfText is by far the best library for extracting text, in my experience. We also use iTextSharp for various tasks, but haven’t found it nearly as good for extracting text.