Automatic detection of prominent words in Russian speech

Author

Kocharov, Daniil

Author_Institution

St.-Petersburg State Univ., St. Petersburg, Russia

fYear

2010

fDate

18-20 Oct. 2010

Firstpage

435

Lastpage

438

Abstract

An experimental research with a goal to automatically detect prominent words in Russian speech is presented in this paper. The proposed automatic prominent word detection system could be further used as a module of an automatic speech recognition system or as a tool to highlight prominent words within a speech corpus for unit selection text-to-speech synthesis. The detection procedure is based on the use of prosodic features such as speech signal intensity, fundamental frequency and speech segment duration. A large corpus of Russian speech of over 200 000 running words was used to evaluate the proposed prosodic features and statistical method of speech data processing. The proposed system is speaker-independent and achieves an efficiency of 84.2 %.

Keywords

feature extraction; natural language processing; speech recognition; speech synthesis; Russian speech; automatic prominent words detection; automatic speech recognition system; speech corpus; speech data processing; statistical method; text-to-speech synthesis; Acoustics; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on

Conference_Location

Wisla

ISSN

2157-5525

Print_ISBN

978-1-4244-6432-6

Type

conf

DOI

10.1109/IMCSIT.2010.5679943

Filename

5679943