DocumentCode
261645
Title
An approach to lexical stress detection from transcribed continuous speech using acoustic features
Author
Domokos, Jozsef ; Stan, Adriana ; Giurgiu, Mircea
Author_Institution
Dept. of Commun., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear
2014
fDate
25-27 Nov. 2014
Firstpage
525
Lastpage
528
Abstract
This paper presents a first approach to the unsupervised learning and prediction of primary lexical stress starting from continuous speech data and its orthographic transcript. The approach is intended to be used in the development of text-to-speech synthesis systems for under-resourced languages. Our method is based on syllable nuclei approximation and stress detection using simple acoustic features. The evaluation is performed on 3.5 hours of speech uttered by a Romanian female speaker and results show an accuracy of 47.20% at word level and 58.61% at syllable level.
Keywords
acoustic signal processing; approximation theory; natural language processing; speech synthesis; unsupervised learning; Romanian female speaker; acoustic features; continuous speech data; lexical stress detection; orthographic transcript; syllable nuclei approximation; text-to-speech synthesis systems; transcribed continuous speech; under-resourced languages; unsupervised learning; Accuracy; Acoustics; Error analysis; Feature extraction; Harmonic analysis; Speech; Stress; lexical stress; stress detection; stress prediction; text-to-speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications Forum Telfor (TELFOR), 2014 22nd
Conference_Location
Belgrade
Print_ISBN
978-1-4799-6190-0
Type
conf
DOI
10.1109/TELFOR.2014.7034462
Filename
7034462
Link To Document