Automatic Phonemes Segmentation for Quran Verses Using Kaldi ToolkitJournal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.10, No. 12)
Publication Date: 2021-12-30
Authors : Alaa Ehab Sakran; Mohsen Rashwan; Sherif Mahdy Abdou;
Page : 39-45
Keywords : Automatic phonetic segmentation; ASSS; AM; LM; GMM-HMM; DNN-HMM; KALDI;
In this paper, automatic segmentation system was built using the Kaldi toolkit at phoneme level for Quran verses data set with a total speech corpus of (80 hours) and its corresponding text corpus respectively, with a size of 1100 recorded Quran verses of 100 non-Arab reciters. Initiated with the extraction of Mel Frequency Cepstral Coefficients MFCCs, the proceedings of the building of Language Model LM and Acoustic Model AM training phase continued until the Deep Neural Network DNN level by selecting 770 waves (70 reciters). The testing of the system was done using 220 waves (20 reciters), and concluded with the selection of the development data set which was 280 waves (10 reciters). Comparison was implemented between automatic and manual segmentation, and the results obtained for the test set was 99% and for the Development set was 99% with Time Delay Neural Networks TDNN based acoustic modelling.
Other Latest Articles
Last modified: 2021-12-30 17:27:29