DocumentCode
3636203
Title
Dialect distance assessment method based on comparison of pitch pattern statistical models
Author
Mahnoosh Mehrabani;Hynek Bořil;John H.L. Hansen
Author_Institution
Center for Robust Speech Systems, Erik Jonsson School of Engineering & Computer Science, The University of Texas at Dallas, USA
fYear
2010
Firstpage
5158
Lastpage
5161
Abstract
Dialect variations of a language have a severe impact on the performance of speech systems. Therefore, knowing how close or diverse dialects are in a given language space provides useful information to predict, or improve, system performance when there is a mismatch between train and test data. Distance measures have been used in several applications of speech processing. However, apart from phonetic measures, little if any work has been done on dialect distance measurement. This study explores differences in pitch movement microstructure among dialects. A method of dialect distance assessment based on pitch patterns modeled progressively from pitch contour primitives is proposed. The presented method does not require any manual labeling and is text-independent. The KL divergence is employed to compare the resulting statistical models. The proposed scheme is evaluated on a corpus of Arabic dialects, and shown to be consistent with the results from the spectral-based dialect classification system. Finally, it is also shown using a perceptive evaluation that the proposed objective approach correlates well with subjective distances.
Keywords
"Natural languages","Speech processing","Testing","Distortion measurement","System performance","Speech recognition","Speech coding","Distance measurement","Feature extraction","Speech synthesis"
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
2379-190X
Type
conf
DOI
10.1109/ICASSP.2010.5495019
Filename
5495019
Link To Document