DocumentCode :
2308556
Title :
Language Identification Using Pitch Contour Information in the Ergodic Markov Model
Author :
Lin, Chi-yueh ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
It had been shown that a segment of pitch contour represented by a set of Legendre polynomial coefficients was successful to the pair-wise language identification task. Feature vectors comprising these polynomial coefficients were formerly modeled by a Gaussian mixture model (GMM) for each language. However, the static model like GMM does not take advantage of the temporal information across several pitch contours. It is intuitive that the temporal information of prosodic features should be used for capturing the characteristics of a specific language. In this paper, a novel dynamic model in ergodic topology is proposed. The experiments show that the proposed method significantly improves the identification rate, even for stress-timed and syllable-timed languages
Keywords :
Gaussian processes; Legendre polynomials; Markov processes; natural languages; speech processing; speech recognition; Gaussian mixture model; Legendre polynomial; ergodic Markov model; ergodic topology; pair-wise language identification; pitch contour information; stress-timed languages; syllable-timed languages; Autocorrelation; Automatic speech recognition; Frequency; Labeling; Natural languages; Polynomials; Speech processing; Statistics; Stress; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1659990
Filename :
1659990
Link To Document :
بازگشت