Home Resources Forum Advertise Contact

Go Back   Webmaster Forums > Search Engines > Search Engine Spiders

Search Engine Spiders Exchange ideas and opinions about search engine spiders here. Discuss about identifying search engine spiders' spidering habits and make it to your advantage here.

Reply
 
LinkBack Thread Tools
  #1 (permalink)  
Old 29th January 2007, 01:59 AM
hassen1 hassen1 is offline
WD Addict Poster
 
Join Date: 29th October 2006
Posts: 2,603
Send a message via Yahoo to hassen1
Default Banning spiders and agents

If you notice entries like Teleport Pro and WebStripper in your traffic reports, someone's been busy attempting to download your web site. You don't have to just sit back and let this happen. If you are commercially hosted, you'll be able to add a couple of lines to your robots.txt file to prevent repeat offenders from stripping your site.

The robots.txt file gives search engine spiders and agents direction by informing them what directories and files they are allowed to examine and retrieve. These rules are called The Robots Exclusion Standard.

To prevent certain agents and spiders from accessing any part of your web site, simply enter the following lines into the robots.txt file:

User-agent: NameOfAgent
Disallow: /

Ensure that you enter the name of the agent exactly as it appeared in your reports/logs e.g. Teleport Pro/1.29 and that there is a separate entry for each agent. Skip a line between entries. You could do the same to exclude search engine spiders, but somehow I don't think you'll really want to do this :0). The "/" in the above example means disallow access to any directory. You can also disallow access by spiders and agents to certain directories e.g.

User-agent: *
Disallow: /cgi-bin/

In this example the asterisk (wildcard) indicates "all". Don't use the asterisk in the Disallow statement to indicate "all", use the forward slash instead.
Reply With Quote
Reply



Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
How do spiders work? cloud9925 Search Engine Spiders 8 21st July 2008 08:23 AM
What are search engine spiders? astros99 Search Engine Spiders 4 7th June 2007 10:05 PM
Not all spiders are good hassen1 Search Engine Spiders 0 29th January 2007 01:58 AM
Crawlers, Agents, Bots, Robots and Spiders hassen1 Search Engine Spiders 0 29th January 2007 01:56 AM
Spiders explained hassen1 Search Engine Spiders 0 1st November 2006 07:42 PM


All times are GMT -4. The time now is 03:48 PM.


Powered by vBulletin
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.0.0 RC6
vB Ad Management by =RedTyger=