I want to prevent google from indexing pdf’s on my website. I have modified

Question

0

Editorial Team

Asked: June 2, 20262026-06-02T06:28:00+00:00 2026-06-02T06:28:00+00:00

I want to prevent google from indexing pdf’s on my website. I have modified

0

I want to prevent google from indexing pdf’s on my website.

I have modified my .htaccess file to include the following lines, as suggested by google’s webmaster tools:

<Files ~ "\.pdf$">   
    Header set X-Robots-Tag "noindex, nofollow" 
</Files>

I know that apache is running properly and reading my .htaccess file, because I can block access to the file entirely, but I cannot tell whether the above command is working.

The google webmaster tools claim that the crawlers can still see the pdfs, but they seem to be intended for only use with robots.txt. Is there a 3rd party tool (for linux) that I can use to check the meta tags with?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-02T06:28:03+00:00

Editorial Team

2026-06-02T06:28:03+00:00Added an answer on June 2, 2026 at 6:28 am

You could use wget on some of the PDFs and look at the headers:

wget -S http://host/something.pdf

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to prevent google from indexing pdf’s on my website. I have modified

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply