Browse Definitions :
Definition

outlier

Contributor(s): Matthew Haughn

An outlier is a single data point that goes far outside the average value of a group of statistics. Outliers may be exceptions that stand outside individual samples of populations as well. In a more general context, an outlier is an individual that is markedly different from the norm in some respect.

Outliers are an important factor in statistics as they can have a considerable effect on overall results. In especially small sample sizes, a single outlier may dramatically affect averages and skew the study's final results.

An outlier can happen due to disinformation by a subject, errors in a subject's responses or in data entry. In some cases, it's clear that outliers should be removed as errors. In others, it may come down to standards or judgment calls where outliers are a natural deviation.

Statisticians, who often attempt to mitigate the effect of outliers, have come up with ways to identify what makes an outlier. For example, in a scatter plot where data points are graphed, outliers are visually identifiable. In a box plot, outliers are found by using equations to find if they exceed defined norms.

Outliers can sometimes indicate errors or poor methods of sample gathering. They can also indicate an anomaly or something of interest to study since it's not always possible to determine if outliers are in error. Although the effects of outliers can skew results of statistics, it is rare that they are entirely removed from results without observations.

This was last updated in February 2018

Continue Reading About outlier

Join the conversation

1 comment

Send me notifications when other members comment.

Please create a username to comment.

How challenging is it to statistically measure application performance?
Cancel

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

SearchSecurity

SearchHealthIT

SearchDisasterRecovery

  • business continuity plan (BCP)

    A business continuity plan (BCP) is a document that consists of the critical information an organization needs to continue ...

  • disaster recovery team

    A disaster recovery team is a group of individuals focused on planning, implementing, maintaining, auditing and testing an ...

  • cloud insurance

    Cloud insurance is any type of financial or data protection obtained by a cloud service provider. 

SearchStorage

  • RAID 6 (redundant array of independent disks)

    RAID 6, also known as double-parity RAID, uses two parity stripes on each disk. It allows for two disk failures within the RAID ...

  • hard disk drive (HDD)

    A computer hard disk drive (HDD) is a non-volatile memory hardware device that controls the positioning, reading and writing of ...

  • byte

    In most computer systems, a byte is a unit of data that is eight binary digits long. Bytes are often used to represent a ...

Close