adamsarticles.com adamsarticles.com
   Index Page :> About Us :> Privacy of Info :> ToS :> Place Your Link :> Add Article
Search:   
Free 3 way links
 

Property & Agents

Adventure & Sports

Travel & Accommodation

Online Shopping

Business & Services

Employment & Careers

Issues & News

Hygiene & Health

Medicine & Treatment

Automotive

Art & Culture

Fashion & Lifestyle

Computers & Software

Recreation

Science & Research

Politics & Government

Academics & Learning

Self Enhancement

Society & Issues

Home Family & Garden

Food & Recipe

Teens & Children

Finance & Banking

Online & Board Games

 

  Index Page » Computers & Software » Internet Domain Names
   
 

Harnessing the Power of Robots.txt

   
Once we have a website up and running, we need to make sure that all visiting search engines can access all the pages we want them to look at.

Sometimes, we may want search engines to not index certain parts of the site, or even ban other SE from the site all together.

This is where a simple, little 2 line text file called robots.txt comes in.

Robots.txt resides in your websites main directory (on LINUX systems this is your /public_html/ directory), and looks something like the following:

User-agent: *
Disallow:

The first line controls the 'bot' that will be visiting your site, the second line controls if they are allowed in, or which parts of the site they are not allowed to visit'

If you want to handle multiple 'bots', then simple repeat the above lines.
So an example:

User-agent: googlebot
Disallow:

User-agent: askjeeves
Disallow: /

This will allow Goggle (user-agent name GoogleBot) to visit every page and directory, while at the same time banning Ask Jeeves from the site completely.
To find a 'reasonably' up to date list of robot user names this visit http://www.robotstxt.org/wc/active/html/index.html

Even if you want to allow every robot to index every page of your site, it's still very advisable to put a robots.txt file on your site. It will stop your error logs filling up with entries from search engines trying to access your robots.txt file that doesn't exist.

For more information on robots.txt see, the full list of resources about robots.txt at http://www.websitesecrets101.com/robotstxt-further-reading-resources

Author: Bruce Hearder
 
Author Bio:

Bruce Hearder owns and runs www.online-money101.com Signup for the Online-Money101 newsletter and learn the simple techniques that everyday people like yourself use to make money on the web, every single day. Visit www.online-money101.com today

This article can be searched using: free domain names, register domain names, purchase domain names, buy web domain names
 
 
 

Related Articles

 
Websites for Writers - Why You Need One and How to Get Started Today
 
Web Site Accessibility - Why Make a Web Site Accessible?
 
7 Steps to Your Personal Online Money-Making Empire
 
Seven Guaranteed Ways to Increase Your Online Sales
 
Capturing Leads on Traffic Exchanges
 
Top 7 Reasons for Using Anti-spam Security Software
 
Using Overture Marketing to Drive Traffic to Your Website
 
Affiliate Marketing Strategy: Ingenious Idea That Works Every Time
 
The Art of Website Maintenance
 
Top Rankings Guarantees Debunked (Again)
 
 
 
Index Page :> Privacy of Info :> ToS  
© 2006-2008 www.adamsarticles.com All Rights Reserved Worldwide.