DocumentCode :
2009556
Title :
Dialect-based speaker classification using speaker-invariant dialect features
Author :
Ma, Xuebin ; Xu, Ruiyuan ; Minematsu, Nobuaki ; Qiao, Yu ; Hirose, Keikichi ; Li, Aijun
Author_Institution :
Grad. Sch. of Frontier Sci., Univ. of Tokyo, Tokyo, Japan
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
171
Lastpage :
176
Abstract :
In our previous works, a structural pronunciation representation was proposed to extract the linguistic features from dialect pronunciation and classify speakers based on their dialects. In this paper, in order to prove that the structural method can extract the purely speaker-invariant dialectal features, several new experiments are carried out. First, using the data of 19 speakers from different dialect and sub-dialect regions, a dialect-based speaker classification experiment is carried out and satisfactory result is achieved. Then, one Chinese dialectologist transcribes all the data and reads the linguistic content of each original utterance in her voice through looking at the transcript and listening to the original utterance. So a new data set with minimum speaker differences (fixed speaker identity) is created. Using the new data, similar classification experiment is carried out and the result is very similar to the result of last experiment. It means that our method can extract the purely speaker-invariant dialectal features and classify speakers based on their dialects very well. After that, for the original and mimicked data sets, data sets with maximum speaker differences are simulated using high-quality voice morphing techniques. Using the original dialect data and the simulated versions together, classification experiments are carried out based two criteria, spectral comparison and structural comparison. By comparing these results, we can find that unlike the method of spectral comparison, the structural method can purely classify speakers based on their dialects, which shows the proposed dialect structures are speaker-independent and linguistic enough features.
Keywords :
feature extraction; pattern classification; speaker recognition; Chinese dialectologist; dialect based speaker classification; linguistic feature; mimicked data set; speaker independent feature; speaker invariant dialect feature; speaker invariant dialectal feature; spectral comparison; structural comparison; structural pronunciation representation; voice morphing technique; Acoustics; Data models; Feature extraction; Gallium nitride; Handwriting recognition; Pragmatics; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684491
Filename :
5684491
Link To Document :
بازگشت