Browse Definitions:


With its combination of news, learning guides, expert advice, white papers, Webcasts and customized research, offers a rich collection of insight on how to efficiently manage the data supply chain. The site also offers tips on vendors, product selection and expert advice.

View the complete archive of Data Management and Warehousing news, research and expert advice.

Go to:  SearchDataManagement

Recently on  SearchDataManagement

Kubernetes gains momentum in big data implementation process

Big data vendors and users are looking to Kubernetes-managed containers to help accelerate system and application deployments and enable more flexible use of computing resources.

More Highlights
  • Hadoop Distributed File System (HDFS)

    The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications.

  • TensorFlow

    TensorFlow is an open source framework developed by Google researchers to run machine learning, deep learning and other statistical and predictive analytics workloads.

  • Hadoop data lake

    A Hadoop data lake is a data management platform comprising one or more Hadoop clusters.

Browse Data Management Topics


Find a wide selection of enterprise data integration (EDI) articles, tutorials, information and resources appropriate for business or technical backgrounds. Read tutorials about different corporate data integration methods, find out about industry trends and get personal advice from our team of EDI experts. Also, check out best practices for evaluating and deploying enterprise data integration software platforms and get tips and tricks through case studies that cover a variety of industries and specific applications, such as business intelligence data integration. Learn definitions and applications of common integration methods such as extract, transform and load (ETL), enterprise application integration (EAI), enterprise information integration (EII), enterprise data replication (EDR) and hand coding, and find out about data-as-a-service and information-as-a-service in a service-oriented architecture.

Recent Definitions

  • data preparation

    Data preparation is the process of gathering, combining, structuring and organizing data so it can be analyzed as part of data visualization, analytics and machine learning applications.

  • data integration

    Data integration is the process of retrieving data from multiple source systems and combining it in such a way that it can yield consistent, comprehensive, current and correct information for business reporting and analysis.

  • data ingestion

    Data can be ingested in real time or in batches. When data is ingested in real time, each data item is imported as it is emitted by the source. When data is ingested in batches, data items are imported in discrete chunks at periodic intervals.


More Integration Topics

Back to Top


Access a full collection of data management resources here. There's a lot going on in the data management industry beyond just technology innovations. Both business and technical professionals can benefit from these exclusive data management resources including tutorials, podcasts and articles about data management jobs, trends and the industry. Stay up-to-date on data management best practices and key industry trends that could affect organizational strategies.

Recent Definitions

  • data janitor (data wrangler)

    A data janitor is an IT employee that cleans up big data sources to prepare them for data analysts and data scientists.

  • data management

    Data management is the practice of organizing and maintaining data processes to meet ongoing information lifecycle needs.

  • data scientist

    A data scientist is a professional responsible for collecting, analyzing and interpreting large amounts of data to identify ways to help a business improve operations and gain a competitive edge over rivals.


More Resources Topics

Back to Top

Quality / governance

This topic section features a comprehensive collection selection of data governance and data quality management software and strategy resources, articles and templates. Get advice on data quality software management planning and implementation, data governance, data stewardship, data profiling and data cleansing from analysts and experts -- even pose specific questions to a panel of experts. Learn how other companies are managing data quality and governance through exclusive case studies that span industries. And, listen to exclusive podcasts with help and data quality management best practices straight from renowned experts.

Recent Definitions

  • data management

    Data management is the practice of organizing and maintaining data processes to meet ongoing information lifecycle needs.

  • Data Protection Bill 2017

    The Data Protection Bill 2017 is legislation that will replace the Data Protection Act of 1998. It is designed to balance the privacy needs of United Kingdom (UK) and European Union (EU) citizens with the interests of business.

  • data citizen

    A data citizen is an employee who relies on data to make decisions and perform job responsibilities. In both a government context and a data context, citizenship offers both rights and responsibilities.


More Quality / governance Topics

Back to Top


A database management system (DBMS) is a critical IT infrastructure component. In this section, you’ll find news, advice and other information resources on database technology and database management issues. Learn about DBMS concepts, database best practices and the potential advantages of DBMS technology as well as database problems to watch out for. Get tips on designing enterprise database management systems, evaluating DBMS software and managing a database implementation. Stay up to date on the latest database trends and market-leading relational database technologies, such as Oracle databases, IBM DB2 and Microsoft SQL Server. Also, read about different types of database management systems, including open source databases. Get personalized advice and answers to your database questions from our panel of database experts, read book excerpts with DBMS examples and use our DBMS tutorials to help you prepare for your next database project.

Recent Definitions

  • MariaDB

    MariaDB is an open source relational database management system (DBMS) that is a compatible drop-in replacement for the widely used MySQL database technology.

  • Microsoft Azure Cosmos DB

    Azure Cosmos DB is a Microsoft cloud database that supports multiple ways of storing and processing data. As such, it is classified as a multimodel database.

  • Microsoft SQL Server

    Microsoft SQL Server is a relational database management system, or RDBMS, that supports a wide variety of transaction processing, business intelligence and analytics applications in corporate IT environments


More DBMS Topics

Back to Top


Get the latest information about enterprise business intelligence (BI) software tools and market trends in articles, news and resources. Quickly learn how to develop or update a business intelligence strategy and roadmap, and find out what trends could affect an organization's business intelligence strategy. Get tutorials on business intelligence fundamentals and read many business intelligence case studies with tips, tricks and best practices from analysts, experts and end users. Find out more about where to find business intelligence jobs, training and the best certifications. And, read about related disciplines such as corporate performance management, business intelligence search, data analytics, predictive analytics, text mining, data mining, executive dashboards, performance scorecards and more.

Recent Definitions

  • consumer privacy (customer privacy)

    Consumer privacy, also known as customer privacy, involves the handling and protection of sensitive personal information that individuals provide in the course of everyday transactions.

  • dark data

    Dark data is digital information that is not being used. Consulting and market research company Gartner Inc. describes dark data as "information assets that an organization collects, processes and stores in the course of its regular business activity, but generally fails to use for other purposes."

  • business intelligence (BI)

    Business intelligence (BI) is a technology-driven process for analyzing data and presenting actionable information to help executives, managers and other corporate end users make informed business decisions.


More BI Topics

Back to Top


Learn about the potential benefits and challenges of deploying enterprise content management systems and using enterprise content management software. Get information and expert advice on creating an enterprise content management strategy, selecting ECM software and managing ECM systems. Find resources on topics such as ECM training, ECM vendors and open source enterprise content management, plus news and best-practices guidance on unstructured data management and unstructured data analysis. Also, get answers to your ECM questions and ECM problems from our enterprise content management expert.


More ECM Topics

Back to Top

Data warehouse

In this section, you’ll find news articles, case studies and other information resources on enterprise data warehousing and data warehouse management. Learn about data warehousing concepts, data warehousing best practices and data warehousing methodologies. Get expert advice on data warehousing basics, such as evaluating enterprise data warehouse software, selecting data warehouse vendors and managing a data warehouse system. Read case studies on data warehouse implementation projects, and get up to speed on technologies such as data warehousing tools and data warehouse appliance hardware. Also, get tips on data warehouse project management and answers to your questions on data warehouse architecture and data warehouse design issues, such as creating a data warehouse schema.

Recent Definitions

  • U-SQL

    U-SQL is a Microsoft query language that combines a declarative SQL-like syntax with C# programming, enabling it to be used to process both structured and unstructured data in big data environments.

  • Hadoop Distributed File System (HDFS)

    The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications.

  • TensorFlow

    TensorFlow is an open source framework developed by Google researchers to run machine learning, deep learning and other statistical and predictive analytics workloads.


More Data warehouse Topics

Back to Top


Enterprise master data management (MDM) promises to help organize data to achieve a "single view of the truth" -- but it's an emerging discipline that still presents many technical and organizational challenges. Learn how to design a master data management strategy with the expert advice in this topic section and read about the latest principles and trends in master data management (MDM) systems and best practices with a wide collection of articles, resources and tutorials. Read analyst firm studies about master data management (MDM) software and learn more about systems from vendors such as IBM and Kalido. Check out master data management (MDM) case studies or ask our expert panel for MDM advice. Also, see how companies use MDM practices like product information management (PIM) to manage catalogs or customer data integration (CDI) to reduce churn and improve customer service. And, find resources on related disciplines such as data governance for master data management projects, data quality management and enterprise data integration.

Recent Definitions

  • data management

    Data management is the practice of organizing and maintaining data processes to meet ongoing information lifecycle needs.

  • metadata management

    Metadata management is the oversight of data associated with data assets to ensure that information can be integrated, accessed, shared, linked, analyzed and maintained to best effect across an organization.

  • golden record

    A golden record is a single, well-defined version of all the data entities in an organizational ecosystem. Sometimes it's called the "single version of the truth," where "truth" is understood to mean the reference to which data users can turn when they want to ensure that they have the correct version of a piece of information.


More MDM Topics

Back to Top


Your data management needs are often dependent on your industry, so we have dedicated a topic section to vertical data management strategies. Find resources for data management in the financial services, healthcare, manufacturing and retail industry. Explore industry-specific advice and case studies to help you design a strategy for managing data that meets the needs of your organization.

Recent Definitions

  • HIPAA (Health Insurance Portability and Accountability Act)

    HIPAA (Health Insurance Portability and Accountability Act of 1996) is United States legislation that provides data privacy and security provisions for safeguarding medical information.

  • edge analytics

    Edge analytics applies algorithms to data at the point of collection in order to trigger actions and determine what should be sent back to a central data repository and what should be discarded.

  • reference data

    Reference data, in the context of data management, are the data objects relevant to transactions, consisting of sets of values, statuses or classification schema.


More Verticals Topics

Back to Top



  • risk map (risk heat map)

    A risk map, also known as a risk heat map, is a data visualization tool for communicating specific risks an organization faces. A...

  • internal audit (IA)

    An internal audit (IA) is an organizational initiative to monitor and analyze its own business operations in order to determine ...

  • pure risk (absolute risk)

    Pure risk, also called absolute risk, is a category of threat that is beyond human control and has only one possible outcome if ...


  • cloud ecosystem

    A cloud ecosystem is a complex system of interdependent components that all work together to enable cloud services.

  • cloud services

    Cloud services is an umbrella term that may refer to a variety of resources provided over the internet, or to professional ...

  • uncloud (de-cloud)

    The term uncloud describes the action or process of removing applications and data from a cloud computing platform.


  • federated identity management (FIM)

    Federated identity management (FIM) is an arrangement that can be made among multiple enterprises to let subscribers use the same...

  • cross-site scripting (XSS)

    Cross-site scripting (XSS) is a type of injection security attack in which an attacker injects data, such as a malicious script, ...

  • firewall

    In computing, a firewall is software or firmware that enforces a set of rules about what data packets will be allowed to enter or...




  • bad block

    A bad block is an area of storage media that is no longer reliable for storing and retrieving data because it has been physically...

  • all-flash array (AFA)

    An all-flash array (AFA), also known as a solid-state storage disk system, is an external storage array that uses only flash ...

  • volume manager

    A volume manager is software within an operating system (OS) that controls capacity allocation for storage arrays.


  • hybrid hard disk drive (HDD)

    A hybrid hard disk drive is an electromechanical spinning hard disk that contains some amount of NAND Flash memory.