|
These tags should be placed within the HEAD element.
Keywords and phrases in the content of the
tag with name="keywords" attribute
can be separated by commas for better readability, although
spiders usually ignore the separators. The maximum number of keywords
depends on the search engine in question; for some of them, 25 words or 200 characters have
been quoted as the upper limit.
Hopefully, the keywords thus specified will be added to
the searchable representation of the document in the engine's
database, and the description will be stored as the summary
to be displayed for the document in a list of results (in the absence
of a description, most search engines will take the first lines of
text on the page).
Another use of the META tag is for excluding a page from
spiders' attention. By adding the following tag,
<META name="robots" content="noindex">
you instruct any spiders that run into your page to bypass it
without indexing.
However, not all spiders support this convention. A more reliable
solution is to add a robots.txt file to the root directory of
your web server, with a list of files that must be excluded from
indexing. For example, your robots.txt might contain these lines:
User-agent: *
Disallow: /dont_index_me.html
Disallow: /hidden_dir/
With these lines, no robot will scan the dont_index_me.html document,
nor any document from the /hidden_dir/. For more information
on robots exclusion, refer to http://info.webcrawler.com/mak/projects/robots/exclusion.html.
|