Search engine (computing)

Search engine (computing)

A search engine is an information retrieval system designed to help find information stored on a computer system. Search engines help to minimize the time required to find information and the amount of information which must be consulted, akin to other techniques for managing information overload. Fact|date=December 2007|date=December 2007

The most public, visible form of a search engine is a Web search engine which searches for information on the World Wide Web.

How search engines work

Search engines provide an interface to a group of items that enables users to specify criteria about an item of interest and have the engine find the matching items. The criteria are referred to as a search query. In the case of text search engines, the search query is typically expressed as a set of words that identify the desired concept that one or more documents may contain [Voorhees, E.M. [http://www.nist.gov/itl/iad/894.02/works/papers/nlp_ir.ps Natural Language Processing and Information Retrieval] . National Institute of Standards and Technology. March 2000.] . There are several styles of search query syntax that vary in strictness. It can also switch names within the search engines from previous sites. Whereas some text search engines require users to enter two or three words separated by white space, other search engines may enable users to specify entire documents, pictures, sounds, and various forms of natural language. Some search engines apply improvements to search queries to increase the likelihood of providing a quality set of items through a process known as query expansion.

The list of items that meet the criteria specified by the query is typically sorted, or ranked. Ranking items by relevance (from highest to lowest) reduces the time required to find the desired information. Probabilistic search engines rank items based on measures of similarity (between each item and the query, typically on a scale of 1 to 0, 1 being most similar) and sometimes popularity or authority (see Bibliometrics) or use relevance feedback. Boolean search engines typically only return items which match exactly without regard to order, although the term "boolean search engine" may simply refer to the use of boolean-style syntax (the use of operators AND, OR, NOT, and XOR) in a probabilistic context.

To provide a set of matching items that are sorted according to some criteria quickly, a search engine will typically collect metadata about the group of items under consideration beforehand through a process referred to as indexing. The index typically requires a smaller amount of computer storage, which is why some search engines only store the indexed information and not the full content of each item, and instead provide a method of navigating to the items in the search engine result page. Alternatively, the search engine may store a copy of each item in a cache so that users can see the state of the item at the time it was indexed or for archive purposes or to make repetitive processes work more efficiently and quickly.

Other types of search engines do not store an index. Crawler, or spider type search engines (a.k.a. real-time search engines) may collect and assess items at the time of the search query, dynamically considering additional items based on the contents of a starting item (known as a seed, or seed URL in the case of an Internet crawler). Meta search engines store neither an index nor a cache and instead simply reuse the index or results of one or more other search engines to provide an aggregated, final set of results.

ee also

*Automatic summarization
*Bibliographic database
*Desktop search
*Enterprise search
*Federated search
*Human search engine
*Image search
*Index (search engine)
*Inverted index
*List of search engines
*Medical literature retrieval
*Metasearch engine
*Search engine optimization
*Semantic search
*Spamdexing
*Text mining
*Vertical search
*Video search engine

References


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Search engine (disambiguation) — Search engine may refer to:*Search engine (computing), an information retrieval system designed to help find information stored on a computer system * Search Engine (radio show), a CBC Radio show *Web search engine, a search engine designed to… …   Wikipedia

  • search engine — ˈsearch ˌengine noun [countable] COMPUTING a computer program that searches the Internet or a computer network for information, especially by looking for documents that contain particular words: • Around 2001, the Google search engine rose to… …   Financial and business terms

  • search engine — ► NOUN Computing ▪ a program for the retrieval of data, files, or documents from a database or network, especially the Internet …   English terms dictionary

  • search engine */ — UK / US noun [countable] Word forms search engine : singular search engine plural search engines computing a computer program used for searching for information on the Internet …   English dictionary

  • Index (search engine) — Search engine indexing collects, parses, and stores data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, physics, and… …   Wikipedia

  • search engine — noun Computing a program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, used especially for searching the Internet …   English new terms dictionary

  • ˈsearch ˌengine — noun [C] computing a computer program that is used for searching for information on the Internet …   Dictionary for writing and speaking English

  • Collaborative search engine — Collaborative Search Engines (CSEs) are an emerging trend for Web search and Enterprise search within company intranets. CSEs let users concert their efforts in information retrieval (IR) activities, share information resources collaboratively… …   Wikipedia

  • search directory — UK US noun [countable] [singular search directory plural search directories] computing a website that contains links to other websites, organized according to subject and in alphabetical order …   Useful english dictionary

  • Terrier Search Engine — Terrier is a software for the rapid development of Web, intranet and desktop search engines. More generally, it is a modular platform for the rapid development of large scale Information Retrieval applications, providing indexing and retrieval… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”