Browse Definitions :
Definition

speech synthesis

Speech synthesis is the computer-generated simulation of human speech. It is used to translate written information into aural information where it is more convenient, especially for mobile applications such as voice-enabled e-mail and unified messaging . It is also used to assist the vision-impaired so that, for example, the contents of a display screen can be automatically read aloud to a blind user. Speech synthesis is the counterpart of speech or voice recognition . The earliest speech synthesis effort was in 1779 when Russian Professor Christian Kratzenstein created an apparatus based on the human vocal tract to demonstrate the physiological differences involved in the production of five long vowel sounds. The first fully functional voice synthesizer, Homer Dudley's VODER (Voice Operating Demonstrator), was shown at the 1939 World's Fair. The VODER was based on Bell Laboratories' vocoder (voice coder) research of the mid-thirties.

Speech prosthesis is computer-generated speech for people with physical disabilities that make it difficult to speak intelligibly. Much of the research in this area integrates text and speech generation both, since the disabilities that create problems with speech frequently make text entry difficult as well. Given the speed and fluidity of human conversation, the challenge of speech prosthesis is to circumvent these difficulties. The main research goal is to create a prosthetic system that will as closely as possible resemble natural speech, with the least required input from the user. Speech prosthesis systems also make it possible for visually-impaired people to use computers.

Multimodal speech synthesis (sometimes referred to as audio-visual speech synthesis) incorporates an animated face synchronized to complement the synthesized speech. The same difficulties underlying an individual's speech impairment often hinder their ability to communicate through facial expressions. Although synthesized speech is increasingly life-like, it may be quite some time before it approaches the capacity for nuances of natural speech. Multimodal systems incorporate a means of adding non-verbal cues to speech (such as head-shaking, smiling, and winking, for example) to make the user's meaning as clear as possible.

This was last updated in September 2005

Continue Reading About speech synthesis

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

SearchCompliance

  • smart contract

    A smart contract, also known as a cryptocontract, is a computer program that directly controls the transfer of digital currencies...

  • risk map (risk heat map)

    A risk map, also known as a risk heat map, is a data visualization tool for communicating specific risks an organization faces. A...

  • internal audit (IA)

    An internal audit (IA) is an organizational initiative to monitor and analyze its own business operations in order to determine ...

SearchSecurity

SearchHealthIT

  • Health IT (health information technology)

    Health IT (health information technology) is the area of IT involving the design, development, creation, use and maintenance of ...

  • fee-for-service (FFS)

    Fee-for-service (FFS) is a payment model in which doctors, hospitals, and medical practices charge separately for each service ...

  • biomedical informatics

    Biomedical informatics is the branch of health informatics that uses data to help clinicians, researchers and scientists improve ...

SearchDisasterRecovery

  • risk mitigation

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a data center.

  • ransomware recovery

    Ransomware recovery is the process of resuming options following a cyberattack that demands payment in exchange for unlocking ...

  • natural disaster recovery

    Natural disaster recovery is the process of recovering data and resuming business operations following a natural disaster.

SearchStorage

  • RAID 5

    RAID 5 is a redundant array of independent disks configuration that uses disk striping with parity.

  • non-volatile storage (NVS)

    Non-volatile storage (NVS) is a broad collection of technologies and devices that do not require a continuous power supply to ...

  • petabyte

    A petabyte is a measure of memory or data storage capacity that is equal to 2 to the 50th power of bytes.

Close