Browse Definitions :
Definition

automatic content classification

Contributor(s): Corinne Bernstein

Automatic content classification is a process for managing text and unstructured information by categorizing or clustering text. By labeling natural language texts with relevant categories from a predefined set, automatic document classification enables users to organize content quickly and efficiently.

While manual document classification may be highly detailed and accurate, it is time-consuming and subjective. Automatic document classification is faster, scalable and more objective. It provides organizations with a more systematic and consistent classification and can be useful in more complex, nuanced contexts, such as business-specific content. Machine learning and artificial intelligence can boost the speed and efficiency of automatic document classification.

The automated classification of texts into predefined categories has gained attention in the past 10 to 15 years due to the increased availability of documents in digital form and the need to get them organized. Today, text classification is applied in many contexts, including document filtering, email spam filtering, automated document metadata generation, word sense disambiguation and hierarchical catalogs of web resources.

Because automatic document classification software defines the requirements for organizing content at the outset, there needs to be a clear, objective configuration of the categories and classification rules before testing, customization  and  refinement can be performed. Key elements of text classification include the ability to analyze the intent, emotion and sentiment of textual data.

Text classification helps companies understand customer behavior by categorizing conversations on social networks, comment sections  and other web sources. Having an effective and consistent automatic content classification system can provide better customer relationship management (CRM), enhance findability for key audiences and improve and organization's ability to monetize customer-generated information.

This was last updated in June 2018

Continue Reading About automatic content classification

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

  • regulatory compliance

    Regulatory compliance is an organization's adherence to laws, regulations, guidelines and specifications relevant to its business...

  • privacy compliance

    Privacy compliance is a company's accordance with established personal information protection guidelines, specifications or ...

  • data governance policy

    A data governance policy is a documented set of guidelines for ensuring that an organization's data and information assets are ...

SearchSecurity

SearchHealthIT

  • telemedicine (telehealth)

    Telemedicine is the remote delivery of healthcare services, such as health assessments or consultations, over the ...

  • Project Nightingale

    Project Nightingale is a controversial partnership between Google and Ascension, the second largest health system in the United ...

  • medical practice management (MPM) software

    Medical practice management (MPM) software is a collection of computerized services used by healthcare professionals and ...

SearchDisasterRecovery

SearchStorage

  • zettabyte

    A zettabyte is a unit of measurement used by technology professionals and the general public to describe a computer or other ...

  • hybrid flash array

    A hybrid flash array is a solid-state storage system that contains a mix of flash memory drives and hard disk drives.

  • NOR flash memory

    NOR flash memory is one of two types of non-volatile storage technologies.

Close