Browse Definitions :
Definition

scatter plot

Contributor(s): Stan Gibilisco

A scatter plot is a set of points plotted on a horizontal and vertical axes.

Scatter plots are important in statistics because they can show the extent of correlation, if any, between the values of observed quantities or phenomena (called variables). If no correlation exists between the variables, the points appear randomly scattered on the coordinate plane. If a large correlation exists, the points concentrate near a straight line. Scatter plots are useful data visualization tools for illustrating a trend. 

Besides showing the extent of correlation, a scatter plot shows the sense of the correlation:

  • If the vertical (or y-axis) variable increases as the horizontal (or x-axis) variable increases, the correlation is positive.
  • If the y-axis variable decreases as the x-axis variable increases or vice-versa, the correlation is negative.
  • If it is impossible to establish either of the above criteria, then the correlation is zero.

The maximum possible positive correlation is +1 or +100%, when all the points in a scatter plot lie exactly along a straight line with a positive slope. The maximum possible negative correlation is -1 or -100%, in which case all the points lie exactly along a straight line with a negative slope.

Correlation is often confused with causation, either accidentally (as a result of false or unproved hypotheses) or deliberately (with intent to deceive). However, in the pure sense, while a scatter plot can reveal the nature and extent of correlation, it says nothing about causation.

This video from RodCastMath explains more about negative and positive correlation on a scatter plot.

See also: graph theory, pictograph, bar graph, point-to-point graph, sparkline, time series chart

This was last updated in December 2012

Continue Reading About scatter plot

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

  • California Consumer Privacy Act (CCPA)

    The California Consumer Privacy Act (CCPA) is legislation in the state of California that supports an individual's right to ...

  • compliance audit

    A compliance audit is a comprehensive review of an organization's adherence to regulatory guidelines.

  • regulatory compliance

    Regulatory compliance is an organization's adherence to laws, regulations, guidelines and specifications relevant to its business...

SearchSecurity

  • BlueKeep (CVE-2019-0708)

    BlueKeep (CVE-2019-0708) is a vulnerability in the Remote Desktop (RDP) protocol that affects Windows 7, Windows XP, Server 2003 ...

  • endpoint detection and response (EDR)

    Endpoint detection and response (EDR) is a category of tools and technology used for protecting computer hardware devices–called ...

  • ransomware

    Ransomware is a subset of malware in which the data on a victim's computer is locked, typically by encryption, and payment is ...

SearchHealthIT

SearchDisasterRecovery

  • disaster recovery team

    A disaster recovery team is a group of individuals focused on planning, implementing, maintaining, auditing and testing an ...

  • cloud insurance

    Cloud insurance is any type of financial or data protection obtained by a cloud service provider. 

  • business continuity software

    Business continuity software is an application or suite designed to make business continuity planning/business continuity ...

SearchStorage

  • blockchain storage

    Blockchain storage is a way of saving data in a decentralized network which utilizes the unused hard disk space of users across ...

  • disk mirroring (RAID 1)

    RAID 1 is one of the most common RAID levels and the most reliable. Data is written to two places simultaneously, so if one disk ...

  • RAID controller

    A RAID controller is a hardware device or software program used to manage hard disk drives (HDDs) or solid-state drives (SSDs) in...

Close