Browse Definitions :
Definition

scatter plot

Contributor(s): Stan Gibilisco

A scatter plot is a set of points plotted on a horizontal and vertical axes.

Scatter plots are important in statistics because they can show the extent of correlation, if any, between the values of observed quantities or phenomena (called variables). If no correlation exists between the variables, the points appear randomly scattered on the coordinate plane. If a large correlation exists, the points concentrate near a straight line. Scatter plots are useful data visualization tools for illustrating a trend. 

Besides showing the extent of correlation, a scatter plot shows the sense of the correlation:

  • If the vertical (or y-axis) variable increases as the horizontal (or x-axis) variable increases, the correlation is positive.
  • If the y-axis variable decreases as the x-axis variable increases or vice-versa, the correlation is negative.
  • If it is impossible to establish either of the above criteria, then the correlation is zero.

The maximum possible positive correlation is +1 or +100%, when all the points in a scatter plot lie exactly along a straight line with a positive slope. The maximum possible negative correlation is -1 or -100%, in which case all the points lie exactly along a straight line with a negative slope.

Correlation is often confused with causation, either accidentally (as a result of false or unproved hypotheses) or deliberately (with intent to deceive). However, in the pure sense, while a scatter plot can reveal the nature and extent of correlation, it says nothing about causation.

This video from RodCastMath explains more about negative and positive correlation on a scatter plot.

See also: graph theory, pictograph, bar graph, point-to-point graph, sparkline, time series chart

This was last updated in December 2012

Continue Reading About scatter plot

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

SearchSecurity

  • cybersecurity

    Cybersecurity is the protection of internet-connected systems, including hardware, software and data, from cyberattacks.

  • asymmetric cryptography (public key cryptography)

    Asymmetric cryptography, also called public key cryptography, uses a pair of numerical keys that are mathematically related to ...

  • digital signature

    A digital signature is a mathematical technique used to validate the authenticity and integrity of a message, software or digital...

SearchHealthIT

SearchDisasterRecovery

  • business continuity plan (BCP)

    A business continuity plan (BCP) is a document that consists of the critical information an organization needs to continue ...

  • disaster recovery team

    A disaster recovery team is a group of individuals focused on planning, implementing, maintaining, auditing and testing an ...

  • cloud insurance

    Cloud insurance is any type of financial or data protection obtained by a cloud service provider. 

SearchStorage

  • hard disk drive (HDD)

    A computer hard disk drive (HDD) is a non-volatile memory hardware device that controls the positioning, reading and writing of ...

  • byte

    In most computer systems, a byte is a unit of data that is eight binary digits long. Bytes are often used to represent a ...

  • network-attached storage (NAS)

    Network-attached storage (NAS) is dedicated file storage that enables multiple users and heterogeneous client devices to retrieve...

Close