• DocumentCode
    1693680
  • Title

    Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech

  • Author

    Kane, John ; Scherer, Stefan ; Aylett, Matthew ; Morency, Louis-Philippe ; Gobl, Christer

  • Author_Institution
    Phonetics & Speech Lab., Trinity Coll. Dublin, Dublin, Ireland
  • fYear
    2013
  • Firstpage
    7982
  • Lastpage
    7986
  • Abstract
    Voice quality plays a pivotal role in speech style variation. Therefore, control and analysis of voice quality is critical for many areas of speech technology. Until now, most work has focused on small purpose built corpora. In this paper we apply state-of-the-art voice quality analysis to large speech corpora built for expressive speech synthesis. A fuzzy-input fuzzy-output support vector machine classifier is trained and validated using features extracted from these corpora. We then apply this classifier to freely available audiobook data and demonstrate a clustering of the voice qualities that approximates the performance of human perceptual ratings. The ability to detect voice quality variation in these widely available unlabelled audiobook corpora means that the proposed method may be used as a valuable resource in expressive speech synthesis.
  • Keywords
    fuzzy set theory; pattern classification; speech processing; speech synthesis; support vector machines; audiobook corpora; expressive speech; fuzzy input fuzzy output support vector machine classifier; human perceptual ratings; language independent voice quality classification; speaker independent voice quality classification; speech corpora; speech style variation; speech synthesis; speech technology; state-of-the-art voice quality analysis; unlabelled corpora; Analysis of variance; Educational institutions; Feature extraction; Hidden Markov models; Speech; Speech synthesis; Support vector machines; Voice quality; audiobooks; expressive speech; glottal source; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639219
  • Filename
    6639219