Browse Definitions :
Definition

automatic content classification

Contributor(s): Corinne Bernstein

Automatic content classification is a process for managing text and unstructured information by categorizing or clustering text. By labeling natural language texts with relevant categories from a predefined set, automatic document classification enables users to organize content quickly and efficiently.

While manual document classification may be highly detailed and accurate, it is time-consuming and subjective. Automatic document classification is faster, scalable and more objective. It provides organizations with a more systematic and consistent classification and can be useful in more complex, nuanced contexts, such as business-specific content. Machine learning and artificial intelligence can boost the speed and efficiency of automatic document classification.

The automated classification of texts into predefined categories has gained attention in the past 10 to 15 years due to the increased availability of documents in digital form and the need to get them organized. Today, text classification is applied in many contexts, including document filtering, email spam filtering, automated document metadata generation, word sense disambiguation and hierarchical catalogs of web resources.

Because automatic document classification software defines the requirements for organizing content at the outset, there needs to be a clear, objective configuration of the categories and classification rules before testing, customization  and  refinement can be performed. Key elements of text classification include the ability to analyze the intent, emotion and sentiment of textual data.

Text classification helps companies understand customer behavior by categorizing conversations on social networks, comment sections  and other web sources. Having an effective and consistent automatic content classification system can provide better customer relationship management (CRM), enhance findability for key audiences and improve and organization's ability to monetize customer-generated information.

This was last updated in June 2018

Continue Reading About automatic content classification

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

  • risk management

    Risk management is the process of identifying, assessing and controlling threats to an organization's capital and earnings.

  • compliance as a service (CaaS)

    Compliance as a Service (CaaS) is a cloud service service level agreement (SLA) that specified how a managed service provider (...

  • data protection impact assessment (DPIA)

    A data protection impact assessment (DPIA) is a process designed to help organizations determine how data processing systems, ...

SearchSecurity

  • cybersecurity insurance (cybersecurity liability insurance)

    Cybersecurity insurance, also called cyber liability insurance or cyber insurance, is a contract that an entity can purchase to ...

  • phishing

    Phishing is a form of fraud in which an attacker masquerades as a reputable entity or person in email or other communication ...

  • cybercrime

    Cybercrime is any criminal activity that involves a computer, networked device or a network.

SearchHealthIT

SearchDisasterRecovery

  • business continuity plan (BCP)

    A business continuity plan (BCP) is a document that consists of the critical information an organization needs to continue ...

  • disaster recovery team

    A disaster recovery team is a group of individuals focused on planning, implementing, maintaining, auditing and testing an ...

  • cloud insurance

    Cloud insurance is any type of financial or data protection obtained by a cloud service provider. 

SearchStorage

  • NVMe over Fabrics (NVMe-oF)

    NVMe over Fabrics, also known as NVMe-oF and non-volatile memory express over fabrics, is a protocol specification designed to ...

  • logical unit number (LUN)

    A logical unit number (LUN) is a unique identifier for designating an individual or collection of physical or virtual storage ...

  • CIFS (Common Internet File System)

    CIFS (Common Internet File System) is a protocol that gained popularity around the year 2000, as vendors worked to establish an ...

Close