• DocumentCode
    1511738
  • Title

    Speech Analysis and Synthesis Based on Dynamic Modes

  • Author

    Vargas, J. ; McLaughlin, Steve

  • Author_Institution
    Escuela de Ing. Electr., Univ. de Los Andes, Merida, Venezuela
  • Volume
    19
  • Issue
    8
  • fYear
    2011
  • Firstpage
    2566
  • Lastpage
    2578
  • Abstract
    In this paper, the source-filter model of speech production is adapted to represent the speech signal as the superposition and convolution of a dynamic source and resonant modes. The aim is to increase the resolution of the time-instantaneous-frequency representation of each of the individual contributions of different sections of the human phonatory system. We present a framework based on dynamic mode predictors and filters, which are adapted, using gradient-based techniques, to track the modal dynamics of speech yielding a representation which is free from quasi-stationary assumptions thus allowing flexible manipulation of the speech signal. Several examples are offered including intonation modifications to illustrate the potential of the proposed approach.
  • Keywords
    filtering theory; gradient methods; prediction theory; signal representation; speech processing; speech synthesis; dynamic mode filters; dynamic mode predictors; gradient-based techniques; human phonatory system; quasi-stationary assumptions; source filter model; speech production; speech signal processing; speech synthesis; time instantaneous-frequency representation; Modal analysis; Speech analysis; Speech processing; Speech synthesis; Dynamic features; instantaneous frequency; instantaneous pitch tracking; intonation modification; modal dynamics; nonstationary models; speech resonances;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2151859
  • Filename
    5764822