DocumentCode :
1686194
Title :
Phonetic segmentation using statistical correction and multi-resolution fusion
Author :
Sixuan Zhao ; Ing Yann Soon ; Soo Ngee Koh ; Luke, Kang Kwong
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2013
Firstpage :
6694
Lastpage :
6698
Abstract :
This paper focuses on the generation of accurate phonetic segmentations. Statistical methods based on absolute and relative correction are discussed and experimented on both monophone and biphone models to improve the segmentation results. The influence of search range on the statistical correction process is studied and a state selection technique is used to enhance the correction results. This paper also explores the influence of resolution (stepsize) of HMMs and proposes a multi-resolution fusion process to further refine the statistically corrected results. Improvements of segmentation results in terms of segmentation accuracy, mean absolute error (MAE), and root mean square error (RMSE) can be observed by applying the proposed refinement methods.
Keywords :
hidden Markov models; speech processing; statistical analysis; HMM resolution; MAE; RMSE; absolute correction; biphone models; mean absolute error; monophone models; multiresolution fusion process; phonetic segmentations; relative correction; root mean square error; search range; segmentation accuracy; segmentation result improvement; state selection technique; statistical correction process; statistical methods; Accuracy; Acoustics; Context modeling; Hidden Markov models; Speech; Speech processing; Training; multi-resolution; phonetic segmentation; state selection; statistical correction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638957
Filename :
6638957
Link To Document :
بازگشت