Part of the Internet technologies glossary:

A spider is a program that visits Web sites and reads their pages and other information in order to create entries for a search engine index. The major search engines on the Web all have such a program, which is also known as a "crawler" or a "bot." Spiders are typically programmed to visit sites that have been submitted by their owners as new or updated. Entire sites or specific pages can be selectively visited and indexed. Spiders are called spiders because they usually visit many sites in parallel at the same time, their "legs" spanning a large area of the "web." Spiders can crawl through a site's pages in several ways. One way is to follow all the hypertext links in each page until all the pages have been read.

Next Steps

The spider for the AltaVista search engine and its Web site is called Scooter . Scooter adheres to the rules of politeness for Web spiders that are specified in the Standard for Robot Exclusion (SRE). It asks each server which files should be excluded from being indexed. It does not (or can not) go through firewall . And it uses a special algorithm for waiting between successive server requests so that it doesn't affect response time for other users.

This was last updated in April 2005
Posted by: Margaret Rouse

Related Terms

Definitions

  • arachnotaxis

    - Arachnotaxis is the use of a table or structured list of URLs for Web sites (or words that hyperlink to Web sites) in order to help locate them. (WhatIs.com)

  • Archie

    - Archie is a program that allows you to search the files of all the Internet FTP servers that offer anonymous FTP. (SearchSOA.com)

  • phantom page

    - A phantom page is a Web page that is optimized for search engines rather than for humans. (WhatIs.com)

Glossaries

  • Internet technologies

    - This WhatIs.com glossary contains terms related to Internet technologies, including definitions about port numbers, standards and protocols and words and phrases about how the Internet works.

  • Internet applications

    - This WhatIs.com glossary contains terms related to Internet applications, including definitions about Software as a Service (SaaS) delivery models and words and phrases about web sites, e-commerce ...

Ask a Question. Find an Answer.Powered by ITKnowledgeExchange.com

Ask An IT Question

Get answers from your peers on your most technical challenges

Ask Question
  • Basic Cisco Switch Setup for Home Network

    http://www.cisco.com/en/US/products/hw/switches/ps700/products_tech_note09186a00801aecbb.shtml Awesome! Thank you. That's just what I've been looking for.

  • RUNSQLSTM

    I don't know that you can prevent it, but you sure can hide it or kill it :-) One "hide it" option would be to: CRTOUTQ TEMP CRTPRTF TEMP OUTQ(TEMP) RUNSQLSTM ... PRTFILE(TEMP) and then periodica...

  • Search engine indexing

Tech TalkComment

Share
Comments

    Results

    Contribute to the conversation

    All fields are required. Comments will appear at the bottom of the article.