Article The article features the search engine as an efficient method to look for required information on the Internet and gives a detailed description of its operation.
Daily we use Internet services and search engines in particular when seeking information. The search results are usually called hits and are provided in the form of a list. The information may consist of web pages, images, information and other types of files. Some search tools also collect data available in databases or open directories. If compared with Web directories which are maintained by human editors, search tools operate automatically or are a mixture of algorithmic and human input.
Internet search tools work by storing information about countless web pages which they retrieve from the WWW. These pages are retrieved by An Internet crawler, also known as a spider. It is an automated Web browser which follows every link it finds. The content of each page is then analyzed to decide how to index it. Words, for instance, are taken from titles, headings and subheadings or special fields called meta tags. Data about web pages are stored in an index database for further use in queries. Some search tools, such as Google, store the entire or part of the source page (referred to as a cache) and information about web pages, while others, such as AltaVista, store every word of every page they discover. This cached page always holds the actual search text, because it is the one that was actually indexed. Hence, it can be very useful because it holds data that can no longer be available elsewhere.
When a web user types key words in the search field, the engine browse through its index and displays a list of the most suitable web pages according to its criteria, normally with a short summary containing the document's title and at times parts of the text. Some search tools offer an advanced feature called proximity search which allows users to determine the length between search words.
The usefulness of a search engine rests on the relevancy of the result set it provides. Since there may be millions of web pages that contain a particular key term or word combination, web pages can be grouped into relevant and irrelevant ones. The results can be ranked to display the "best" ones first.
How a search engine determines which pages are the best matches, and in what arrangement the results should be shown, varies from one engine to another. The methods also alter with time, since the use of Internet services undergoes alterations and new techniques emerge.