Skip Navigation Links 

U of A University Information Technology Services

Was this page helpful?
 [+]





...Or log into AskIT
and request help.

 

Search Engines

How can I get search engines to find my Web site?

You can use meta tags, but they are not the entire solution. When a crawling robot finds your page, this will tell them to catalog it:
<META name="ROBOTS" content="INDEX">

Meta keywords also help search engines find and catalog your site. For example:
<META name="keywords" content="stamps, stamp collecting, stamp history, prices, stamps for sale">

You can also submit your site to search engines; this is usually the best way to be indexed.
For Google:
http://www.google.com/addurl/?continue=/addurl
For Yahoo:
http://search.yahoo.com/info/submit.html

Frames, splash screens, dynamically generated pages, and using images instead of text can cause your site to not be indexed.

How can I keep search engines from finding my web page?

The Robots Exclusion Protocol is a method that allows web site administrators to indicate to visiting robots which parts of their site should not be visited by the robot by placing a robots.txt file in the top level of the server which the robot checks first to see which documents it is allowed to retrieve. If you are not a server administrator, you will not be able to place rules in this file.

The Robots META tag allows HTML authors to indicate to visiting robots if a document may be indexed or used to harvest more links. No server administrator action is required.
Note: Currently only a few robots implement this.

In this example:

<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">

a robot should neither index this document nor analyse it for links.

Like other META tags, the Robots META tag should be placed in the HEAD of an HTML page:

<html>
<head>
<meta name="robots" content="noindex,nofollow">
<meta name="description" content="This page ....">
<title>...</title>
</head>
<body>
...

The content of the Robots META tag contains directives separated by commas. The currently defined directives are [NO]INDEX and [NO]FOLLOW. The INDEX directive specifies if an indexing robot should index the page. The FOLLOW directive specifies if a robot is to follow links on the page. The defaults are INDEX and FOLLOW. The values ALL and NONE set all directives on or off.

Some examples:

<meta name="robots" content="index,follow">
<meta name="robots" content="noindex,follow">
<meta name="robots" content="index,nofollow">
<meta name="robots" content="noindex,nofollow">

Note the "robots" name of the tag. The content are case insensitive. You should not specify conflicting or repeating directives.

For more information, see The Web Robots Pages.

 

 

Thank you for visiting UITS. This page can be found at:
http://uits.uark.edu/web/index_4770_ENG_HTML.htm