Title :
Minimally balanced corpus for speech recognition
Author :
Irtza, S. ; Hussain, Shiraz
Author_Institution :
Electr. Eng. Dept., Univ. of Eng. & Technol., Lahore, Pakistan
Abstract :
This paper reports the method of collecting minimally balanced corpus for speech recognition. Generally balanced corpora are used for training speech recognition systems. However, these balanced corpora are not optimal. The current paper demonstrates that these corpora can be reduced to a varied degree for various phonemes for developing a minimally balanced corpus. The experiments have been developed on ten speakers´ speech data. Recognition accuracy and amount of training data of phonemes have been analyzed. The result for these speakers shows that different phonemes require a different amount of training data for optimal training.
Keywords :
speech recognition; training; minimally balanced corpus; optimal training data; recognition accuracy; speaker speech data; training speech recognition systems; Accuracy; Acoustics; Error analysis; Speech; Speech recognition; Training; Training data; Minimally balanced corpus; Urdu speech corpus; speech recognition;
Conference_Titel :
Communications, Signal Processing, and their Applications (ICCSPA), 2013 1st International Conference on
Conference_Location :
Sharjah
Print_ISBN :
978-1-4673-2820-3
DOI :
10.1109/ICCSPA.2013.6487286