On a Linux, Apache, PHP site, I need to make certain that a subdirectory

Question

0

Asked: May 27, 20262026-05-27T00:30:14+00:00 2026-05-27T00:30:14+00:00

On a Linux, Apache, PHP site, I need to make certain that a subdirectory

0

On a Linux, Apache, PHP site, I need to make certain that a subdirectory /cms, on my website is not crawlable by the search engines.

See, in the root of the site, I have installed a product catalog called Pinnacle Cart. They wanted a News page that pulls content from a CMS. I brought WordPress online in a subdirectory called /cms, created some posts, and then used the following code to bring that into my Pinnacle Cart theme:

<?php require_once('../../../cms/wp-blog-header.php'); ?>
<?php $i = 1; $MAX_ARTICLES_TO_SHOW = 5; ?>
<?php while (have_posts()): the_post(); ?>
    <div <?php post_class() ?> id="post-<?php the_id(); ?>">
        <h2><?php the_title(); ?></h2>
        <div class="entry">
            <?php the_content(); ?>
        </div><!-- .entry -->
        <div style="clear:both;">&nbsp;</div>
        <small><?php the_time('F j, Y') ?></small>
    </div><!-- #post-... -->
<?php ++$i; if ($i > $MAX_ARTICLES_TO_SHOW) { break; } ?>
<?php endwhile; ?>

Note that some of the images used in the posts will pull from /cms, and I want those to load okay, but I don’t want Google or any search engine to follow anything under /cms.

Note also in WordPress in /cms, I checked off the setting “Do not let sites like Google, Technorati, etc. index this site.”

I’m thinking I’ll need to either adjust the default theme for the WordPress under /cms/wp-content/themes, or put some sort of .htaccess setting in the /cms or / (root) folder of the site.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T00:30:15+00:00

Editorial Team

2026-05-27T00:30:15+00:00Added an answer on May 27, 2026 at 12:30 am

You can add this to your robots.txt file.

Disallow: /cms/

Reads more about it at http://www.robotstxt.org/robotstxt.html

Search engines and scrapers can always ignore this though (Most large search engines will follow the rules). You could check the $_SERVER['HTTP_USER_AGENT'] too, but this can be faked. There is no 100% way of stopping scrapers.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

On a Linux, Apache, PHP site, I need to make certain that a subdirectory

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply