Title :
Enhanced lengthening cancellation using bidirectional pitch similarity alignment for spontaneous speech
Author :
Po-Yi Shih ; Bo-Wei Chen ; Jhing-Fa Wang ; Jhing-Wei Wu
Author_Institution :
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
In this work, an enhanced lengthening cancellation method is proposed to detect and cancel the lengthening part of vowels. The proposed method consists of autocorrelation function, cosine similarity-based lengthening detection and bidirectional pitch contour alignment. Autocorrelation function is used to obtain the reference pitch contour. Cosine similarity-based method is applied to measure the similarity between the reference and the next adjacent pitch contours. Due to the variant lengths of periodic segments, fixed size frames may cause accumulative errors. Therefore, bidirectional pitch contour alignment is adopted in this study. Experiments indicate that the proposed method can achieve an accuracy rate of 91.4% and 88.7% on a 60-keyword and 50-scentence database, respectively. Moreover, the proposed approach performs about three times speed than the baseline. Such results prove the effectiveness of the proposed method.
Keywords :
speech processing; 50-scentence database; 60-keyword database; bidirectional pitch contour alignment; bidirectional pitch similarity alignment; cosine similarity-based lengthening detection; enhanced lengthening cancellation; spontaneous speech; vowels; Abstracts; Europe; Speech; keyword spotting; lengthening cancellation; pitch contour alignment; speech preprocessing; spontaneous speech recognition;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423517