this is my robots.txt . I want to only allow the base url domain.com

Question

0

Asked: June 2, 20262026-06-02T19:16:13+00:00 2026-06-02T19:16:13+00:00

this is my robots.txt . I want to only allow the base url domain.com

0

this is my robots.txt. I want to only allow the base url domain.com for indexing and disallow all sub urls like domain.com/foo and domain.com/bar.html.

User-agent: *
Disallow: /*/

Because I am not sure whether this is a valid syntax I tested it using Google Webmaster Tools. It shows me this message.

robots.txt file is probably invalid.

Is my file valid? Is there a better way of only allowing the base url for indexing?

Update: Google downloaded my robots.txt 4 hours ago. I think thats why it doesn’t work. I will wait some time and if the problem stays I will update my question again.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-02T19:16:15+00:00

Here is a link to a validator. It might help you work through any errors in the file.

Robots.txt Checker

I checked on another validator, robots.txt Checker, and this is what I got for the second line:

Wildcard characters (like “*”) are not allowed here The line below
must be an allow, disallow, comment or a blank line statement

This might be what you’re looking for:

User-Agent: *
Allow: /index.html
Disallow: /

This assumes your homepage is index.html.

If index.php is your homepage, you should be able to swap out index.html for index.php.

User-Agent: *
Allow: /index.php
Disallow: /

On my dynamic websites that run through index.php, going to mydomain.com/index.php still takes me to the homepage, so the above should work.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

this is my robots.txt . I want to only allow the base url domain.com

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply