Browse Definitions :
Definition

noisy channel model

The noisy channel model is a framework used in natural language processing (NLP) to identify the correct word in situations where it is unclear. The framework helps detect intended words for spell checkers, virtual assistants, translation programs, question answering systems and speech to text software.

Noise, in this context, is anything that obscures signals and data. The noisy channel model is so-named because the original signal – the intended word -- is obscured in transmission when disruptions or errors create noise in the channel. In this case, noise could consist of a misspelling in written language, for example, or ambient sounds, mispronunciation or slurred speech in spoken language.

Here’s a basic example of how the noisy channel model might work with a spell check program:

When a word is not found in the spell check dictionary, it is identified as a misspelling and candidate words are suggested based on their probability of being the intended word, usually as a result of how close those words are to the misspelling. As a rule, the most likely candidates will involve a single change, and there are four different types of single-change errors: deletion, insertion, substitution and reversal. If the misspelled word is acress, for example, acres could have been intended but an extra s added; actress could have been intended but the t missed; across could have been intended but e typed instead of o; caress could have been intended but the first two letters typed in the wrong sequence. Thus, if the user types acress, the spell check program might suggest those four words.

Due to the vagaries of human speech and the potential for actual noise, speech recognition software has additional challenges to text-based systems. However, the basic framework is similar.

This was last updated in August 2019

Continue Reading About noisy channel model

SearchCompliance

  • information governance

    Information governance is a holistic approach to managing corporate information by implementing processes, roles, controls and ...

  • enterprise document management (EDM)

    Enterprise document management (EDM) is a strategy for overseeing an organization's paper and electronic documents so they can be...

  • risk assessment

    Risk assessment is the identification of hazards that could negatively impact an organization's ability to conduct business.

SearchSecurity

  • cyber espionage

    Cyber espionage, also called cyber spying, is a form of cyber attack that is carried out against a competitive company or ...

  • virus (computer virus)

    A computer virus is malicious code that replicates by copying itself to another program, computer boot sector or document and ...

  • honeypot (computing)

    A honeypot is a network-attached system set up as a decoy to lure cyber attackers and detect, deflect and study hacking attempts ...

SearchHealthIT

SearchDisasterRecovery

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a business.

  • call tree

    A call tree is a layered hierarchical communication model that is used to notify specific individuals of an event and coordinate ...

  • Disaster Recovery as a Service (DRaaS)

    Disaster recovery as a service (DRaaS) is the replication and hosting of physical or virtual servers by a third party to provide ...

SearchStorage

  • cloud storage

    Cloud storage is a service model in which data is transmitted and stored on remote storage systems, where it is maintained, ...

  • cloud testing

    Cloud testing is the process of using the cloud computing resources of a third-party service provider to test software ...

  • storage virtualization

    Storage virtualization is the pooling of physical storage from multiple storage devices into what appears to be a single storage ...

Close