What is data hygiene? - Definition from WhatIs.com
Part of the Data and data management glossary:

Data hygiene is the collective processes conducted to ensure the cleanliness of data. Data is considered clean if it is relatively error-free. Dirty data can be caused by a number of factors including duplicate records, incomplete or outdated data, and the improper parsing of record fields from disparate systems. Errors can be introduced at any stage as data is entered, stored and managed.

Data quality is crucial to operational and transactional processes within the enterprise and to the reliability of business analytics (BA) / business intelligence (BI) reporting.

Data scrubbing, also called data cleansing, is the process of amending or removing data in a database that is incorrect, incomplete, improperly formatted, or duplicated. Typically the process involves updating it, standardizing it, and de-duplicating records to create a single view of the data, even even if it is stored in multiple disparate systems.

This was last updated in April 2013
Contributor(s): Ivy Wigmore
Posted by: Margaret Rouse

Related Terms

Definitions

  • Microsoft Azure Data Lake

    - Microsoft Azure Data Lake is a highly scalable data storage and analytics service hosted in Azure, Microsoft's public cloud. The service is largely intended for big data storage and analysis. (searchCloudComputing.com)

  • data-driven decision management (DDDM)

    - Data-driven decision management (DDDM) is an approach to business governance that values actions that can be backed up with verifiable data. The success of a data-driven approach is reliant upon th... (WhatIs.com)

  • fast data

    - Fast data is the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value. The goal of fast data is to quickly gather an... (WhatIs.com)

Glossaries

  • Data and data management

    - Terms related to data, including definitions about data warehousing and words and phrases about data management.

  • Internet applications

    - This WhatIs.com glossary contains terms related to Internet applications, including definitions about Software as a Service (SaaS) delivery models and words and phrases about web sites, e-commerce ...

Ask a Question. Find an Answer.Powered by ITKnowledgeExchange.com

Ask An IT Question

Get answers from your peers on your most technical challenges

Ask Question

Tech TalkComment

Share
Comments

    Results

    Contribute to the conversation

    All fields are required. Comments will appear at the bottom of the article.