DocumentCode
2308556
Title
Language Identification Using Pitch Contour Information in the Ergodic Markov Model
Author
Lin, Chi-yueh ; Wang, Hsiao-Chuan
Author_Institution
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
It had been shown that a segment of pitch contour represented by a set of Legendre polynomial coefficients was successful to the pair-wise language identification task. Feature vectors comprising these polynomial coefficients were formerly modeled by a Gaussian mixture model (GMM) for each language. However, the static model like GMM does not take advantage of the temporal information across several pitch contours. It is intuitive that the temporal information of prosodic features should be used for capturing the characteristics of a specific language. In this paper, a novel dynamic model in ergodic topology is proposed. The experiments show that the proposed method significantly improves the identification rate, even for stress-timed and syllable-timed languages
Keywords
Gaussian processes; Legendre polynomials; Markov processes; natural languages; speech processing; speech recognition; Gaussian mixture model; Legendre polynomial; ergodic Markov model; ergodic topology; pair-wise language identification; pitch contour information; stress-timed languages; syllable-timed languages; Autocorrelation; Automatic speech recognition; Frequency; Labeling; Natural languages; Polynomials; Speech processing; Statistics; Stress; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1659990
Filename
1659990
Link To Document