Tech Support > Computers & Technology > Is my robots.txt file set up properly?
Is my robots.txt file set up properly?
Posted by why? on September 20th, 2007



On Thu, 20 Sep 2007 09:38:25 -0700, Evan Platt wrote:

<snip>

Maybe something like
http://www.clockwatchers.com/robots_bad.html
robots.txt Tutorial - Block Bad Bots

Some bots will ignore robots.txt files as they don't care if you want
them on your web site or not.

These can be blocked by using a .htaccess file instead.


Another site also mentions blocking a named bot, as above.
http://www.thesitewizard.com/archive/robotstxt.shtml
If you have a particular robot in mind, such as the picsearch robot, you
may have lines like the following:

User-agent: psbot
Disallow: /

This means that the picsearch robot, "psbot", should not try to access
any file in the root directory "/" and all its subdirectories. This
effectively means that psbot is banned from the entire of your website.

You can have multiple Disallow lines for each user agent (ie, for each
spider). Here is an example of a longer robots.txt file:


Maybe adding the meta tag to pages automatically when requested?
http://www.askapache.com/seo/updated...wordpress.html
Robots Meta Tag


HTH
Me


Similar Posts