I read Create a tiff with only text and no images from a postscript file with ghostscript and try to use KenS`s answer.
But this method remove only “black” images – image contain data only in black channel (PDF has colorspace CMYK). How can i remove all images in my case?
I read Create a tiff with only text and no images from a postscript
Share
This does a better job, but its incomplete. It doesn’t deal with images using multiple data sources for example. Its essentially untested, except that I did test your smaller file (pages.pdf) by using ps2write to convert to PostScript and then the PostScript program below, and teh pdfwrite device, to convert back to PDF.
One of the first things you will notice is that almost all the text has vanished from your document. That’s because the fonts you are using are bitmap fonts, and the program can’t tell the difference between a bitmap representing a character, and any other kind of bitmap. For this file you can solve that by removing the definition of imagemask because all the characters use imagemask, and the other images use ‘image’.
I have a sneaky suspicion the formatting of the program is going to get messed up here 🙁