DocumentCode :
2023548
Title :
Ranknet based English stressed syllable detection
Author :
Yang, Xiaohong ; Chen, Qingcai ; Wan, Ling ; Wang, Xiaolong
Author_Institution :
Dept. of Comput. Sci. & Technol., Harbin Inst. of Technol., Shenzhen, China
fYear :
2010
fDate :
23-25 Nov. 2010
Firstpage :
1084
Lastpage :
1089
Abstract :
A lot of time or frequency domain speech features had been applied to address the problem of English stressed syllable detection. Researchers had proved that the combination of multiple features is necessary to get better performance. But up to now, the tasks of seeking new feasible speech features and innovative feature fusion approaches are still open. This paper proposes a detection-by-ranking approach to address the stressed syllable detection problem based on the RankNet technique. The approach is able to find out the stressed syllable through one by one comparison of feature vectors corresponding to vowels of syllables in a multi-syllable word. This paper also introduces the fractal dimensions of each vowel as one type of the stress features. Experiments conducted on the corpus TIMIT show that the proposed feature fusion method reaches high performance and the introducing of fractal dimension is helpful for improving the detection correct rate.
Keywords :
speech recognition; Ranknet; detection-by-ranking approach; english stressed syllable detection; feature fusion approaches; Cepstrum; Error analysis; Feature extraction; Fractals; Speech; Stress; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-5856-1
Type :
conf
DOI :
10.1109/ICALIP.2010.5685107
Filename :
5685107
Link To Document :
بازگشت