I own some webspace which is registered with a University. Google has unfortunately found my CV (resume) on the site, but has mis-indexed it as a scholarly publication, which is screwing up things like citation counts on Google Scholar. I tried to upload a robots.txt into my local subdirectory. The problem is that google ignores this file, and instead uses the rules listed for the school domain.
That is, the url looks like
I have uploaded a robots.txt, which can be found here
And Google is ignoring it and instead using the robots.txt for the school’s domain
How can I make Googlebot ignore my CV?
Sadly,
robots.txtis defined to be whatever you get when youGET /robots.txt, so you can’t use it for your subdirectory.What you can do is use the
X-Robots-TagHTTP header, if you can use custom.htaccessfiles. Here’s Google’s documentation onX-Robots-Tag.