What is document metadata? - Definition from WhatIs.com


document metadata

Part of the Content management glossary:

Document metadata is information attached to a text-based file that may not be visible on the face of the document; documents may also contain supporting elements such as graphic images, photographs, tables and charts, each of which can have its own metadata.

Metadata summarizes basic information about data, which can make finding and working with particular instances of data easier. Having the ability to filter through that metadata makes it much easier for someone to locate a specific document or other data asset in a variety of different ways. 

Document metadata in Microsoft Word, for example, includes the file size, date of document creation, the names of the author and most recent modifier, the dates of any changes and the total edit time. Further metadata can be added, including title, tags and comments. 

Editing features like the "Track changes" option in Word also generate metadata such as text that has been deleted and comments between authors and editors. Because that content can contain sensitive information, it's important to be aware of metadata security measures and take appropriate steps to protect corporate data assets from unauthorized access. Document sanitization, for example, is the process of ensuring that only the intended information can be accessed from a text-based file. 

See also: metadata management

This was last updated in August 2014
Contributor(s): Ivy Wigmore
Posted by: Margaret Rouse

Related Terms


  • data feed

    - A data feed is an ongoing stream of structured data that provides users with updates of current information from one or more sources. (WhatIs.com)

  • federated search (universal search)

    - Federated search is an approach to information retrieval that aggregates query results from multiple information sources. Federated search may also be called universal search. (WhatIs.com)

  • predictive coding

    - Predictive coding software can be used to automate portions of an e-discovery document review. The goal of predictive coding is to reduce the number of irrelevant and non-responsive documents that ... (SearchCompliance.com)


  • Content management

    - Terms related to content management, including definitions about enterprise content management and words and phrases about content management applications (CMA) and content management systems (CMS).

  • Internet applications

    - This WhatIs.com glossary contains terms related to Internet applications, including definitions about Software as a Service (SaaS) delivery models and words and phrases about web sites, e-commerce ...

Ask a Question About document metadataPowered by ITKnowledgeExchange.com

Get answers from your peers on your most technical challenges

Tech TalkComment



    Contribute to the conversation

    All fields are required. Comments will appear at the bottom of the article.