• DocumentCode
    2802814
  • Title

    Preserving the character of perturbations in scaled pitch contours

  • Author

    Baran, Thomas ; Malyska, Nicolas ; Quatieri, Thomas F.

  • Author_Institution
    MIT Lincoln Lab., Lincoln, MA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    417
  • Lastpage
    420
  • Abstract
    The global and fine dynamic components of a pitch contour in voice production, as in the speaking and singing voice, are important for both the meaning and character of an utterance. In speech, for example, slow pitch inflections, rapid pitch accents, and irregular regions all comprise the pitch contour. In applications where all components of a pitch contour are stretched or compressed in the same way, as for example in time-scale modification, an unnatural scaled contour may result. In this paper, we develop a framework for scaling pitch contours, motivated by the goal of maintaining naturalness in time-scale modification of voice. Specifically, we develop a multi-band algorithm to independently modify the slow trajectory and fast perturbation components of a contour for a more natural synthesis, and we present examples where pitch contours representative of speaking and singing voice are lengthened. In the speaking voice, the frequency content of flutter or irregularity is maintained, while slow pitch inflection is simply stretched or compressed. In the singing voice, rapid vibrato is preserved while slower note-to-note variation is scaled as desired.
  • Keywords
    acoustic signal processing; musical acoustics; speech synthesis; fast perturbation components; multiband algorithm; note-to-note variation; pitch inflections; rapid pitch accents; rapid vibrato; scaled pitch contours; singing voice; speaking voice; time-scale modification; utterance; voice production; Bandwidth; Frequency; Jitter; Laboratories; Scattering; Signal synthesis; Sociotechnical systems; Speech synthesis; Vocoders; Time-scale modification; pitch perturbation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495774
  • Filename
    5495774