• DocumentCode
    3145399
  • Title

    Statistical approach to voice quality control in esophageal speech enhancement

  • Author

    Yamamoto, Kenzo ; Toda, Tomoki ; Doi, Hironori ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4497
  • Lastpage
    4500
  • Abstract
    This paper describes a voice quality control method in statistical esophageal speech enhancement. Esophageal speech is produced by one of the alternative speaking methods for laryngectomees. Its naturalness and intelligibility are much lower than those of natural voices and its voice quality sounds similar even if uttered by different laryngectomees. These issues are alleviated by a statistical voice conversion method from esophageal speech into normal speech (ES-to-Speech) based on eigenvoices. This method is capable of determining converted voice quality using a few target voice samples. In this paper, we propose ES-to-Speech using regression techniques to make it possible to manually control the converted voice quality by manipulating a few intuitively controllable parameters even if no target voice sample is available. The effectiveness of the proposed method is confirmed by experimental evaluations.
  • Keywords
    acoustic signal processing; medical signal processing; regression analysis; speech processing; alternative speaking methods; converted voice quality; eigenvoices; laryngectomees; normal speech; regression techniques; statistical esophageal speech enhancement; statistical voice conversion method; target voice samples; voice quality control method; Acoustics; Estimation; Kernel; Quality control; Speech; Speech enhancement; Vectors; Esophageal speech; kernel regression; speech enhancement; voice conversion; voice quality control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6287949
  • Filename
    6287949