Yaliasearch engine optimizationPHP integrationOS Commerceonline marketingsmall business web sitesintegrated marketingpay per clickonline advertisingkeyword optimizationGUI designmeta tagsweb graphicssearch engine optimizationpaid inclusionfree consultationweb designsearch engine optimizationweb designsearch engine optimisationaffordable web designsearch engine optimizationbusiness web designsearch engine optimizatione-commerce developmentSEO expertsPHP applicationscontent management systemweb designshopping cart softwaremarketing communicationmarcom expertsweb designPHPbb, bBlog, Open Realty, PostNuke, OS Commerce, Simple PollPostNukeBoulder Coloradoweb designsearch engine placementColorado web designlink management softwarepersonal organizerjob search softwareweb designsearch engine rankingweb designAsk Jeevesweb designsearch engine optimizationweb designGoogle optimizationweb designopen source toolsweb designsearch engine optimizationPHP applicationssearch engine submissionsPHP programmingkeyword analysisPHP application designsearch engine rankingcustom programmingGoogle PageRankgraphic designsearch engine optimisationweb designTeoma placement
 
 
home           what           who           why           when           how           where          contact          
 
 

Meta Tags 105: No Will Robinson! - The Robots Exclusion Protocol

The Robots Exclusion Protocol is a method that allows Web site administrators to indicate to visiting robots which parts of their site should not be visited by the robot. Simply put, when a Robot vists a Web site, say http://www.nederlandinternet.com/, it firsts checks for http://www.nederlandinternet.com/robots.txt. If it can find this document, it will analyze its contents for records like:

User-agent: *
Disallow: /

to see if it is allowed to retrieve the document.

The "/robots.txt" file usually contains a record looking like this:

User-agent: *
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /images/

In this example, three directories are excluded to all robots. The user-agent tag is where you specify whether all robots (*) or only specific robots are to follow these instructions.

One thing to remember is that you need a separate "Disallow" line for every URL prefix you want to exclude. You can exclude entire directories, as shown in the example above, or specific files, as in "Disallow: /images/mypicture.jpg."

Note also that regular expression are not supported in either the User-agent or Disallow lines. The '*' in the User-agent field is a special value meaning "any robot". Specifically, you cannot have lines like "Disallow: /tmp/*" or "Disallow: *.jpg".

What you want to exclude depends on your server. Everything not explicitly disallowed is considered fair game to retrieve. This brings us to a discussion of META TAGS [more].

View list of all articles.


Yalia Technology Design
Nederland, Colorado
v:(303) 258-0333
e: yaliadesign @ gmail.com

About Yalia Technology Design

Yalia Technology Design was formed to fill a specific gap in the marketplace, as a company that offers clients something more than a traditional web design company. Pulling from expertise in the areas of technology, project management, marketing communications and branding, Yalia provides a multi-disciplinary approach to web design and custom application development. More than just a technology company - we also offer a full range of marketing services including traditional collateral design, brand consulting, online and email marketing services, search engine optimization and search engine submissions.

SEO Services  php MySQL web design  Web Site Tools  custom programming  Portfolio  Nederland web design  FAQs  open source integration  Resources & Downloads  recommended reading  Books  open source tools  Links PHP based web site tools  Customers  PHP programming  Articles  PHP programming  Billing  PHP programming  Order

copyright 2003, Yalia Technology Design
Yaliasearch engine optimizationPHP integrationOS Commerceonline marketingsmall business web sitesintegrated marketingpay per clickonline advertisingkeyword optimizationGUI designmeta tagsweb graphicssearch engine optimizationpaid inclusionfree consultationweb designsearch engine optimizationweb designsearch engine optimisationaffordable web designsearch engine optimizationbusiness web designsearch engine optimizatione-commerce developmentSEO expertsPHP applicationscontent management systemweb designshopping cart softwaremarketing communicationmarcom expertsweb designPHPbb, bBlog, Open Realty, PostNuke, OS Commerce, Simple PollPostNukeBoulder Coloradoweb designsearch engine placementColorado web designlink management softwarepersonal organizerjob search softwareweb designsearch engine rankingweb designAsk Jeevesweb designsearch engine optimizationweb designGoogle optimizationweb designopen source toolsweb designsearch engine optimizationPHP applicationssearch engine submissionsPHP programmingkeyword analysisPHP application designsearch engine rankingcustom programmingGoogle PageRankgraphic designsearch engine optimisationweb designTeoma placement