Text Normalization for Telugu Text-to-Speech Synthesis
Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY (Vol.11, No. 2)Publication Date: 2013-12-06
Authors : Sunitha; P.Sunitha Devi;
Page : 2241-2249
Keywords : Speech Synthesis; Classification; Token Sense Disambiguation; Text Normalization.;
Abstract
Most areas related to language and speech technology, directly or indirectly, require handling of unrestricted text, and Text-to-speech systems directly need to work on real text. To build a natural sounding speech synthesis system, it is essential that the text processing component produce an appropriate sequence of phonemic units corresponding to an arbitrary input text. A novel approach is used, where the input text is tokenized, and classification is done based on token type. The token sense disambiguation is achieved by the semantic nature of the language and then the expansion rules are applied to get the normalized text. However, for Telugu language not much work is done on text normalization. In this paper we discuss our efforts for designing a rule based system to achieve text normalization in the context of building Telugu text-to-speech system.
Other Latest Articles
- Study of Vulnerability Diagnosis and Sustaining Integrity of the Embedded Devices
- A Novel Technique for Trust Delivery in the Cloud
- Aggregating IDS Alerts Based on Time Threshold: Testing and Results
- Software Defect Prevention through Orthogonal Defect Classification (ODC)
- Circuit Optimization For Transmission Gate Master Slave Flip-Flops
Last modified: 2016-06-29 18:44:27