Browse Definitions :
Definition

data lineage

Contributor(s): Matthew Haughn

Data lineage is the history of data, including where the data has traveled through-out the its existence within an organization. Data lineage is a required part of corporate and government data policy compliance. Tracking the history of data is achieved through data lineage documentation and software. Without a way to identify where data errors are introduced into the environment, it is difficult for data stewards to identify and fix data quality issues.

With effective tools, data governance can be eased through the documentation of data’s entire journey through the organization. The documentation of data lineage helps simplify two of the main data governance concerns in for the effects of changes in data: root cause analysis and business impact analysis (BIA). Clear understanding of root causes and impacts of issues with data is aided by knowing everything that happened to the data since it came to be.

In software development, the tracking of data lineage can help with reconciling the difficulties between Agile development best practices, data governance regulations and company data policy. Data lineage tools and procedures help track where data flaws were introduced, which can ease diagnoses and correction. Implementing the tracking of data lineage can be difficult and often seen as a low priority, however, earlier correction means less error propagation, which means the implementation of data lineage tools early in the process often proves worth the effort.

This was last updated in January 2019

Continue Reading About data lineage

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

SearchCompliance

  • compliance audit

    A compliance audit is a comprehensive review of an organization's adherence to regulatory guidelines.

  • regulatory compliance

    Regulatory compliance is an organization's adherence to laws, regulations, guidelines and specifications relevant to its business...

  • Whistleblower Protection Act

    The Whistleblower Protection Act of 1989 is a law that protects federal government employees in the United States from ...

SearchSecurity

  • orphan account

    An orphan account, also referred to as an orphaned account, is a user account that can provide access to corporate systems, ...

  • voice squatting (skill squatting)

    Voice squatting is an attack vector for voice user interfaces (VUIs) that exploits homonyms (words that sound the same but are ...

  • WPA3

    WPA3, also known as Wi-Fi Protected Access 3, is the third version of the security certification program developed by the Wi-Fi ...

SearchHealthIT

SearchDisasterRecovery

  • business continuity policy

    Business continuity policy is the set of standards and guidelines an organization enforces to ensure resilience and proper risk ...

  • business continuity and disaster recovery (BCDR)

    Business continuity and disaster recovery (BCDR) are closely related practices that describe an organization's preparation for ...

  • warm site

    A warm site is a type of facility an organization uses to recover its technology infrastructure when its primary data center goes...

SearchStorage

  • cache memory

    Cache memory, also called CPU memory, is high-speed static random access memory (SRAM) that a computer microprocessor can access ...

  • enterprise storage

    Enterprise storage is a centralized repository for business information that provides common data management, protection and data...

  • disk array

    A disk array, also called a storage array, is a data storage system used for block-based storage, file-based storage or object ...

Close