DocumentCode
2970541
Title
A study on Hidden Structural Model and its application to labeling sequences
Author
Qiao, Yu ; Suzuki, Masayuki ; Minematsu, Nobuaki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
118
Lastpage
123
Abstract
This paper proposes hidden structure model (HSM) for statistical modeling of sequence data. The HSM generalizes our previous proposal on structural representation by introducing hidden states and probabilistic models. Compared with the previous structural representation, HSM not only can solve the problem of misalignment of events, but also can conduct structure-based decoding, which allows us to apply HSM to general speech recognition tasks. Different from HMM, HSM accounts for the probability of both locally absolute and globally contrastive features. This paper focuses on the fundamental formulation and theories of HSM. We also develop methods for the problems of state inference, probability calculation and parameter estimation of HSM. Especially, we show that the state inference of HSM can be reduced to a quadratic programming problem. We carry out two experiments to examine the performance of HSM on labeling sequences. The first experiment tests HSM by using artificially transformed sequences, and the second experiment is based on a Japanese corpus of connected vowel utterances. The experimental results demonstrate the effectiveness of HSM.
Keywords
parameter estimation; speech coding; speech recognition; Japanese corpus; hidden structural model; labeling sequences; parameter estimation; probability calculation; speech recognition; state inference; structural representation; structure-based decoding; Decoding; Hidden Markov models; Information science; Labeling; Natural languages; Paper technology; Parameter estimation; Probability; Robustness; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373239
Filename
5373239
Link To Document