• DocumentCode
    1323869
  • Title

    Time-Varying Autoregressions in Speech: Detection Theory and Applications

  • Author

    Rudoy, Daniel ; Quatieri, Thomas F. ; Wolfe, Patrick J., Sr.

  • Author_Institution
    Stat. & Inf. Sci. Lab., Harvard Univ., Cambridge, MA, USA
  • Volume
    19
  • Issue
    4
  • fYear
    2011
  • fDate
    5/1/2011 12:00:00 AM
  • Firstpage
    977
  • Lastpage
    989
  • Abstract
    This paper develops a general detection theory for speech analysis based on time-varying autoregressive models, which themselves generalize the classical linear predictive speech analysis framework. This theory leads to a computationally efficient decision-theoretic procedure that may be applied to detect the presence of vocal tract variation in speech waveform data. A corresponding generalized likelihood ratio test is derived and studied both empirically for short data records, using formant-like synthetic examples, and asymptotically, leading to constant false alarm rate hypothesis tests for changes in vocal tract configuration. Two in-depth case studies then serve to illustrate the practical efficacy of this procedure across different time scales of speech dynamics: first, the detection of formant changes on the scale of tens of milliseconds of data, and second, the identification of glottal opening and closing instants on time scales below ten milliseconds.
  • Keywords
    autoregressive processes; decision theory; maximum likelihood detection; prediction theory; speech processing; time-varying systems; decision-theoretic procedure; detection theory; false alarm rate; generalized likelihood ratio test; glottal opening; linear predictive speech analysis; speech dynamics; speech waveform data; time-varying autoregressions; vocal tract variation; Glottal airflow; likelihood ratio test; linear prediction; nonstationary time series; vocal tract variation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2073704
  • Filename
    5570952