Browse Definitions :
Definition

data lineage

Data lineage is the history of data, including where the data has traveled through-out the its existence within an organization. Data lineage is a required part of corporate and government data policy compliance. Tracking the history of data is achieved through data lineage documentation and software. Without a way to identify where data errors are introduced into the environment, it is difficult for data stewards to identify and fix data quality issues.

With effective tools, data governance can be eased through the documentation of data’s entire journey through the organization. The documentation of data lineage helps simplify two of the main data governance concerns in for the effects of changes in data: root cause analysis and business impact analysis (BIA). Clear understanding of root causes and impacts of issues with data is aided by knowing everything that happened to the data since it came to be.

In software development, the tracking of data lineage can help with reconciling the difficulties between Agile development best practices, data governance regulations and company data policy. Data lineage tools and procedures help track where data flaws were introduced, which can ease diagnoses and correction. Implementing the tracking of data lineage can be difficult and often seen as a low priority, however, earlier correction means less error propagation, which means the implementation of data lineage tools early in the process often proves worth the effort.

This was last updated in January 2019

Continue Reading About data lineage

SearchCompliance

  • information governance

    Information governance is a holistic approach to managing corporate information by implementing processes, roles, controls and ...

  • enterprise document management (EDM)

    Enterprise document management (EDM) is a strategy for overseeing an organization's paper and electronic documents so they can be...

  • risk assessment

    Risk assessment is the identification of hazards that could negatively impact an organization's ability to conduct business.

SearchSecurity

  • PKI (public key infrastructure)

    PKI (public key infrastructure) is the underlying framework that enables entities -- users and servers -- to securely exchange ...

  • obfuscation

    Obfuscation means to make something difficult to understand.

  • dumpster diving

    Dumpster diving is looking for treasure in someone else's trash.

SearchHealthIT

SearchDisasterRecovery

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a business.

  • call tree

    A call tree is a layered hierarchical communication model that is used to notify specific individuals of an event and coordinate ...

  • Disaster Recovery as a Service (DRaaS)

    Disaster recovery as a service (DRaaS) is the replication and hosting of physical or virtual servers by a third party to provide ...

SearchStorage

  • cloud storage

    Cloud storage is a service model in which data is transmitted and stored on remote storage systems, where it is maintained, ...

  • cloud testing

    Cloud testing is the process of using the cloud computing resources of a third-party service provider to test software ...

  • storage virtualization

    Storage virtualization is the pooling of physical storage from multiple storage devices into what appears to be a single storage ...

Close