- Index term
An index term or descriptor in
Information Retrieval is a term that captures the essence of the topic of a document. It is used as keyword to retrieve documents in an information system, for instance a catalog or asearch engine . A popular form of keywords on the web are tags which are directly visible and can be assigned by non-experts also. Index terms can consist of a word, phrase, or alphanumerical term. They are created by analyzing the document either manually withsubject indexing or automatically with automatic indexing or more sophisticated methods of keyword extraction. Index terms can either come from acontrolled vocabulary or be freely assigned.Keywords are stored in a search index. Common words like articles (a, an, the) and conjunctions (and, or, but) are not treated as keywords because it is inefficient to do so. Almost every English-language site on the Internet has the article "the", and so it makes no sense to search for it. The most popular search engine,
Google removedstop words such as "the" and "a" from its indexes for several years, but then re-introduced them, making certain types of precise search possible again.The term "descriptor" was coined by
Calvin Mooers in 1948.The
Simple Knowledge Organisation System language (SKOS) provides a way to express index terms withResource Description Framework for use in the context ofSemantic Web .References
Cite book
edition = 1
publisher = The MIT Press
isbn = 0262194333
last = Svenonius
first = Elaine
title = The Intellectual Foundation of Information Organization
date = 2000
Wikimedia Foundation. 2010.