Text-to-Speech Synthesis using Phoneme Concatenation
Journal: International Journal of Scientific Engineering and Technology (IJSET) (Vol.3, No. 2)Publication Date: 2014-02-01
Authors : Mahwash Ahmed Shibli Nisar;
Page : 193-197
Keywords : Text-To-Speech; Phonemes; Time Domain Pitch Synchronous Overlap-Add (TD-PSOLA); concatenative synthesis;
Abstract
Text-to-speech (TTS) synthesis transforms any linguistic information stored as data or text into speech. It is widely used in audio reading devices for blind people. In the last few years however, the use of TTS technology has grown far beyond the disabled community and become a major adjunct to the rapidly growing use of digital voice storage for voice mail and voice response systems. Concatenative TTS synthesis system has gained in popularity in recent years, due to its more natural sounding synthesized speech. It concatenates pre-recorded speech units into the word sequences according to the pronunciation dictionary or set of rules [i]. For general purpose TTS, it must be able to read unrestricted text [ii]. Thus it is desirable to have the basic speech units much smaller, like for example phonemes or diaphones, in order to be able to synthesize all possible phonetic and prosodic variation in the language with a limited database size.
Other Latest Articles
- Condition Monitoring of Ball Bearings Using Statistical Analysis
- Convective Heat and Mass Transfer Flow Over A Vertical Plate With Nth Order Chemical Reaction In A Porous Medium
- Convective Heat and Mass Transfer Flow Over A Vertical Plate With Nth Order Chemical Reaction In A Porous Medium
- A Brief Review on Advance Manufacturing Process of Automobile seat Production
- Categories for Housing Performance Evaluation
Last modified: 2014-02-04 19:56:59