• DocumentCode
    1997401
  • Title

    Pitch-synchronous time alignment of speech signals for prosody transplantation

  • Author

    Latsch, Vagner L. ; Netto, Sergio L.

  • Author_Institution
    DEL, Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
  • fYear
    2011
  • fDate
    15-18 May 2011
  • Firstpage
    2405
  • Lastpage
    2408
  • Abstract
    Prosody transplantation is a speech signal modification procedure usually used to voice transformation or to evaluate the quality of speech synthesizers. In practice, the pitch contour is mapped onto a common segmental content and the target signal is modified adjusting position and length of speech frames to achieve the desired pitch contour and time duration from a speech reference. A new algorithm for prosody transplantation is presented based on a pitch-synchronous feature extraction of the speech signal, unifying the time-aligning and pitch-modification stages. The result is a computationally efficient algorithm for prosody transplantation that maximizes the spectral similarity between the target and reference signals.
  • Keywords
    feature extraction; speech synthesis; pitch contour; pitch-synchronous feature extraction; pitch-synchronous time alignment; prosody transplantation; spectral similarity; speech frames; speech reference; speech signal modification procedure; speech synthesizers; time duration; voice transformation; Approximation algorithms; Heuristic algorithms; Interpolation; Labeling; Partitioning algorithms; Speech; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (ISCAS), 2011 IEEE International Symposium on
  • Conference_Location
    Rio de Janeiro
  • ISSN
    0271-4302
  • Print_ISBN
    978-1-4244-9473-6
  • Electronic_ISBN
    0271-4302
  • Type

    conf

  • DOI
    10.1109/ISCAS.2011.5938088
  • Filename
    5938088