Browse Definitions :
Definition

automatic content classification

Contributor(s): Corinne Bernstein

Automatic content classification is a process for managing text and unstructured information by categorizing or clustering text. By labeling natural language texts with relevant categories from a predefined set, automatic document classification enables users to organize content quickly and efficiently.

While manual document classification may be highly detailed and accurate, it is time-consuming and subjective. Automatic document classification is faster, scalable and more objective. It provides organizations with a more systematic and consistent classification and can be useful in more complex, nuanced contexts, such as business-specific content. Machine learning and artificial intelligence can boost the speed and efficiency of automatic document classification.

The automated classification of texts into predefined categories has gained attention in the past 10 to 15 years due to the increased availability of documents in digital form and the need to get them organized. Today, text classification is applied in many contexts, including document filtering, email spam filtering, automated document metadata generation, word sense disambiguation and hierarchical catalogs of web resources.

Because automatic document classification software defines the requirements for organizing content at the outset, there needs to be a clear, objective configuration of the categories and classification rules before testing, customization  and  refinement can be performed. Key elements of text classification include the ability to analyze the intent, emotion and sentiment of textual data.

Text classification helps companies understand customer behavior by categorizing conversations on social networks, comment sections  and other web sources. Having an effective and consistent automatic content classification system can provide better customer relationship management (CRM), enhance findability for key audiences and improve and organization's ability to monetize customer-generated information.

This was last updated in June 2018

Continue Reading About automatic content classification

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

SearchCompliance

  • Whistleblower Protection Act

    The Whistleblower Protection Act of 1989 is a law that protects federal government employees in the United States from ...

  • smart contract

    A smart contract, also known as a cryptocontract, is a computer program that directly controls the transfer of digital currencies...

  • risk map (risk heat map)

    A risk map, also known as a risk heat map, is a data visualization tool for communicating specific risks an organization faces. A...

SearchSecurity

  • challenge-response authentication

    In information security, challenge-response authentication is a type of authentication protocol where one entity presents a ...

  • Secure Shell (SSH)

    SSH, also known as Secure Shell or Secure Socket Shell, is a network protocol that gives users, particularly system ...

  • honeypot (computing)

    A honeypot is a network-attached system set up as a decoy to lure cyberattackers and to detect, deflect or study hacking attempts...

SearchHealthIT

SearchDisasterRecovery

  • virtual disaster recovery

    Virtual disaster recovery is a type of DR that typically involves replication and allows a user to fail over to virtualized ...

  • tabletop exercise (TTX)

    A tabletop exercise (TTX) is a disaster preparedness activity that takes participants through the process of dealing with a ...

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a data center.

SearchStorage

  • exbibyte (EiB)

    An exbibyte (EiB) is a unit used to measure data capacity.

  • zebibyte (ZiB)

    A zebibyte (ZiB) is a unit used to measure computing and storage capacity.

  • tiered storage

    Tiered storage is a way to assign different categories of data to various types of storage media with the objective of reducing ...

Close