• DocumentCode
    2801887
  • Title

    Audio-based context awareness acoustic modeling and perceptual evaluation

  • Author

    Eronen, Antti ; Tuomi, Jarkko ; Klapuri, Anssi ; Fagerlund, Seppo

  • Author_Institution
    Tampere Univ. of Technol., Finland
  • fYear
    2003
  • fDate
    19-22 Oct. 2003
  • Firstpage
    50
  • Abstract
    Summary form only given. The paper concerns the development of a system for the recognition of a context or an environment based on acoustic information only. Our system uses Mel-frequency cepstral coefficients and their derivatives as features, and continuous density hidden Markov models (HMM) as acoustic models. We evaluate different model topologies and training methods for HMMs and show that discriminative training can yield a 10% reduction in error rate compared to maximum-likelihood training. A listening test is made to study the human accuracy in the task and to obtain a base-line for the assessment of the performance of the system. Direct comparison to human performance indicates that the system performs somewhat worse than human subjects do in the recognition of 18 everyday contexts and almost comparably in recognizing six higher level categories.
  • Keywords
    acoustic signal processing; audio signal processing; error statistics; hearing; hidden Markov models; learning (artificial intelligence); pattern recognition; Mel-frequency cepstral coefficients; acoustic modeling; audio-based context recognition; continuous density hidden Markov models; continuous density hidden Markov models HMM; discriminative training; error rate; maximum-likelihood training; model topologies; perceptual evaluation; Acoustic noise; Amplitude estimation; Context awareness; Context modeling; Error analysis; Frequency domain analysis; Hidden Markov models; Humans; Predictive models; Psychoacoustic models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
  • Print_ISBN
    0-7803-7850-4
  • Type

    conf

  • DOI
    10.1109/ASPAA.2003.1285814
  • Filename
    1285814