• DocumentCode
    3390935
  • Title

    Application of the phase vocoder to pitch-preserving synchronization of an audio stream to an external clock

  • Author

    Sussman, Rob ; LaRoche, Julie

  • Author_Institution
    Joint E-mu/Creative Technol. Center, Scotts Valley, CA, USA
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    75
  • Lastpage
    78
  • Abstract
    The phase vocoder is usually presented as a high-quality solution for time-scale modification of signals, Its main advantages versus the cheaper time-domain techniques include the high-quality of the output for a wide range of types of input signals (speech, music, noise), and the possibility to perform very large factor modifications (e.g., four-fold time-stretching or more). In this paper, we present two applications that require such extreme modification factors: we call the first one pitch-preserving audio scrubbing, in which a user can move a pointer along an audio track and hear the sound at the corresponding location without any pitch alteration. Because the user controls the playback location (and therefore the playback speed), and can very well stop at a given location, the required time-scale modification can involve a very large-factor. The second application consists of synchronizing an audio stream to a video stream, while avoiding pitch alteration. For extreme slow-motion playback, the time-scaling operation required to preserve the pitch can also involve a very large factor. We address theoretical and practical issues related to pitch-preserving synchronization of an audio track. Techniques are discussed to allow freezing time in the phase-vocoder and avoid problems associated with very large factor modifications
  • Keywords
    synchronisation; vocoders; audio stream; audio track; external clock; four-fold time-stretching; input signals; music; noise; phase vocoder; pitch-preserving audio scrubbing; pitch-preserving synchronization; playback location; playback speed; slow-motion playback; speech; time-scale modification; very large factor modifications; video stream; Acoustic noise; Audio recording; Clocks; Digital-analog conversion; Multiple signal classification; Speech enhancement; Streaming media; Synchronization; Time domain analysis; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 1999 IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • Print_ISBN
    0-7803-5612-8
  • Type

    conf

  • DOI
    10.1109/ASPAA.1999.810853
  • Filename
    810853