• DocumentCode
    538075
  • Title

    Automatic detection of prominent words in Russian speech

  • Author

    Kocharov, Daniil

  • Author_Institution
    St.-Petersburg State Univ., St. Petersburg, Russia
  • fYear
    2010
  • fDate
    18-20 Oct. 2010
  • Firstpage
    435
  • Lastpage
    438
  • Abstract
    An experimental research with a goal to automatically detect prominent words in Russian speech is presented in this paper. The proposed automatic prominent word detection system could be further used as a module of an automatic speech recognition system or as a tool to highlight prominent words within a speech corpus for unit selection text-to-speech synthesis. The detection procedure is based on the use of prosodic features such as speech signal intensity, fundamental frequency and speech segment duration. A large corpus of Russian speech of over 200 000 running words was used to evaluate the proposed prosodic features and statistical method of speech data processing. The proposed system is speaker-independent and achieves an efficiency of 84.2 %.
  • Keywords
    feature extraction; natural language processing; speech recognition; speech synthesis; Russian speech; automatic prominent words detection; automatic speech recognition system; speech corpus; speech data processing; statistical method; text-to-speech synthesis; Acoustics; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
  • Conference_Location
    Wisla
  • ISSN
    2157-5525
  • Print_ISBN
    978-1-4244-6432-6
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2010.5679943
  • Filename
    5679943