I am developing a C# application in which I am converting a PDF document to an image and then rendering that image in a custom viewer.
I’ve come across a bit of a brick wall when trying to search for specific words in the generated image and I was wondering what the best way to go about this would be. Should I find the x,y location of searched word?
You can use tessract OCR image for text recognition in console mode.
I don’t know about such SDK for pdf.
BUT, if you want to get all word coordinates and values, you can use next my not complex code, thank nguyenq for hocr hint: