Home Resources Forum Advertise Contact

Go Back   Webmaster Forums > Search Engines > Search Engine Spiders

Search Engine Spiders Exchange ideas and opinions about search engine spiders here. Discuss about identifying search engine spiders' spidering habits and make it to your advantage here.

Reply
 
LinkBack Thread Tools
  #1 (permalink)  
Old 22nd June 2006, 07:19 AM
varun1182 varun1182 is offline
WD Addict Poster
 
Join Date: 21st June 2006
Posts: 200
Default Explain robots.txt

I found following line in an article regarding robots.txt
"The robots.txt file should be created in Unix line ender mode! Do not attempt to use an HTML editor that does not specifically have a text mode to create a robots.txt file"

Does this restrict me to use text editors like editplus, editpad to create robots.txt??
Reply With Quote
  #2 (permalink)  
Old 22nd June 2006, 11:48 AM
athmane athmane is offline
WD Addict Poster
 
Join Date: 20th June 2006
Posts: 100
Default

Greetings Var,

The robots.txt file is a a text file so naturally you can use any text editor to create it. I think what the Article was trying to get across is that some HTML editors might not render text, so they shouldn't be used.

If you want a hassle-free robots.txt file created on the fly for all major search engines robots out there, then I recommend using the free Robots.txt Generator .
Reply With Quote
  #3 (permalink)  
Old 14th September 2006, 06:40 AM
backstage backstage is offline
WD Addict Poster
 
Join Date: 21st June 2006
Posts: 200
Default

Be careful with robots!

There are robots that visit your site with bad intentions!
There are many, robots whose sole purpose is to scan your website and extract your email address for spamming purposes!
Reply With Quote
  #4 (permalink)  
Old 16th September 2006, 04:01 PM
natasha85 natasha85 is offline
WD Newbie
 
Join Date: 9th June 2006
Posts: 36
Default

oh, that's scary, how do we prevent those from happening?
anti spam? anti virus?
are those helpful in dealing with those bad robots?
Reply With Quote
  #5 (permalink)  
Old 17th September 2006, 08:49 AM
backstage backstage is offline
WD Addict Poster
 
Join Date: 21st June 2006
Posts: 200
Default

www.robotstxt.org/wc/exclusion-admin.html

This site will help you with that.
Reply With Quote
  #6 (permalink)  
Old 3rd January 2007, 08:45 PM
fredilan fredilan is offline
Super Moderator
 
Join Date: 18th December 2006
Posts: 49
Default

one way on how to prevent using such malicious robot scripts is by using Robots from credible sources.
Reply With Quote
Reply



Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
How to set up a robots.txt to control search engine spiders arpan911 Search Engine Spiders 3 20th June 2008 06:56 PM
Supporting wildcards in robots.txt hassen1 Yahoo! News 1 18th November 2006 12:06 AM
Yahoo! slurp now supports wildcards in robots.txt hassen1 Yahoo! News 2 16th November 2006 12:50 AM
Google ignores the meta robots noindex tag. dizyn1 Google Search Engine Optimization 0 21st June 2006 04:23 AM


All times are GMT -4. The time now is 01:03 AM.


Powered by vBulletin
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.0.0 RC6
vB Ad Management by =RedTyger=