DocumentCode :
2017439
Title :
Phone boundary refinement using ranking methods
Author :
Lo, Hung-Yi ; Wang, Hsin-Min
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
488
Lastpage :
492
Abstract :
The HMM/SVM-based two-stage framework has been widely used for automatic phone alignment. The two-stage method uses SVM classifiers to refine the hypothesized boundaries given by the HMM-based Viterbi forced alignment. However, there are two drawbacks in using the classification model for detecting the phone boundaries. First, the training data contains only information about the boundary and far away non-boundary signal characteristics. Second, the classification model suffers from the class-imbalanced training problem. To overcome these drawbacks, we propose using ranking methods to refine the hypothesized boundaries. We train multiple phone-transition-dependent rankers by using K-means-based and decision-tree-based clustering. Both Ranking SVM and RankBoost are evaluated. The results of experiments on the TIMIT corpus demonstrate that the proposed ranking method outperforms the classification method. The best accuracy achieved is 84.20% within a tolerance of 10 ms. The mean boundary distance is 6.66 ms.
Keywords :
decision trees; hidden Markov models; maximum likelihood estimation; pattern classification; speech processing; support vector machines; HMM based Viterbi forced alignment; HMM-SVM based two stage framework; K-mean based clustering; SVM classifier; TIMIT corpus; automatic phone alignment; class imbalanced training problem; decision tree based clustering; hypothesized boundary; multiple phone transition dependent ranker; nonboundary signal characteristics; phone boundary refinement; ranking method; Feature extraction; Hidden Markov models; Speech; Speech recognition; Support vector machines; Training; Training data; RankBoost; automatic phone segmentation; ranking SVM; ranking method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684874
Filename :
5684874
Link To Document :
بازگشت