distributed search

Part of the TechTarget Network of Enterprise IT Web Sites

Search our IT-specific encyclopedia for:
 
Browse alphabetically:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #
All Categories Internet Applications

distributed search

Distributed search is a search engine model in which the tasks of Web crawling, indexing and query processing are distributed among multiple computers and networks.

Originally, most search engines were supported by a single supercomputer . In recent years, however, most have moved to a distributed model. Google search, for example, relies upon thousands of computers crawling the Web from multiple locations all over the world.

In Google's distributed search system, each computer involved in indexing crawls and reviews a portion of the Web, taking a URL and following every link available from it (minus those marked for exclusion). The computer gathers the crawled results from the URLs and sends that information back to a centralized server in compressed format. The centralized server then coordinates that information in a database , along with information from other computers involved in indexing.

When a user types a query into the search field, Google's domain name server ( DNS ) software relays the query to the most logical cluster of computers, based on factors such as its proximity to the user or how busy it is. At the recipient cluster, the Web server software distributes the query to hundreds or thousands of computers to search simultaneously. Hundreds of computers scan the database index to find all relevant records. The index server compiles the results, the document server pulls together the titles and summaries and the page builder creates the search result pages.

Some projects, such as Wikia Search (formerly Grub ) are moving towards an even more decentralized search model. Similarly to distributed computing projects such as SETI@home , many distributed search projects are supported by a network of voluntary users whose computers run client software in the background.



Last updated on: Apr 24, 2008

>  Enterprise Software related Research & News
>  White Papers for the Retail Industry

Are you a Know-IT-All?
What technology is solid-state lighting based on?
Answer

word of the day Sign up for the Word of the Day
twitter Follow us on Twitter


WORD OF THE DAY...
above the fold
LEARN MORE ABOUT...
Mobile Web design and testing
wear leveling
write amplification
write endurance
decision management
business process governance
Profile-Driven Storage
Resilient File System (ReFS)
Security, Trust and Assurance Registry (STAR)
Windows Server 8
community cloud
managed storage
facial recognition
Shared serial-attached SCSI (SAS)
open compute project
BIOS password
dynamic BPM (business process management)
social BPM (business process management)
in-circuit emulator (ICE)
above the fold
logic simulator
photometric stereo
dynamic case management (DCM)
raw device mapping (RDM)
WhatIs.com RSS Feeds
About Us   |   Contact Us   |   For Advertisers   |   For Business Partners   |   Reprints   |   RSS   |   Awards
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints




All Rights Reserved, Copyright 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts