ACOUSTIC MODELING FOR KAZAKH SPEECH SYNTHESIS
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.19, No. 5)Publication Date: 2019-10-01
Authors : A.K. Kaliyev S.V. Rybin;
Page : 951-954
Keywords : acoustic model; speech synthesis; Kazakh language; generative adversarial network (GAN); speech corpus;
Abstract
We present a new framework of generative adversarial network for training of acoustic model for speech synthesis. The proposed generative adversarial network consists of a generator and a pair of agent discriminators, where the generator predicts the acoustic features from the linguistic representation. Training and testing were carried out on the Kazakh speech corpus, which consisted of 5.6 hours of speech recording. According to the experiment results the 3.46 mean opinion score was obtained which shows an acceptable quality of speech synthesis. This approach of the acoustic model development can be applied in speech synthesis systems of the other languages.
Other Latest Articles
- VISUALIZATION OF NACRE STRUCTURE LAYERS BY SPECTRAL OPTICAL COHERENCE MICROSCOPY METHOD
- SOFTWARE FOR DEFORMABLE SOLID MECHANICS
- MANAGEMENT SYSTEM FOR SCALABLE GEOGRAPHICALLY DISTRIBUTED DATA CENTER
- REMOTE NEUROREHABILITATION PORTAL FUNCTIONALITY
- ANALYTICAL COMPARISON OF BASE STATION REACH FOR VARIOUS MULTICARRIER SIGNAL SCHEMES
Last modified: 2019-10-23 21:02:06