DocumentCode
3461115
Title
Audio-based context awareness - acoustic modeling and perceptual evaluation
Author
Eronen, Antri ; Tuom, Juha ; Klapuri, Anssi ; Fagerlund, Seppo ; Sorsa, Timo ; Lorho, Gaëtan ; Huopaniemi, Jyri
Author_Institution
Inst. of Signal Process., Tampere Univ. of Technol., Finland
Volume
5
fYear
2003
fDate
6-10 April 2003
Abstract
The paper concerns the development of a system for the recognition of a context or an environment based on acoustic information only. Our system uses Mel-frequency cepstral coefficients and their derivatives as features, and continuous density hidden Markov models (HMM) as acoustic models. We evaluate different model topologies and training methods for HMMs and show that discriminative training can yield a 10% reduction in error rate compared to maximum-likelihood training. A listening test is made to study the human accuracy in the task and to obtain a baseline for the assessment of the performance of the system. Direct comparison to human performance indicates that the system performs somewhat worse than human subjects do in the recognition of 18 everyday contexts and almost comparably in recognizing six higher level categories.
Keywords
acoustic signal processing; audio signal processing; error statistics; feature extraction; hearing; hidden Markov models; learning (artificial intelligence); HMM; MFCC; Mel-frequency cepstral coefficients; acoustic information; acoustic modeling; context awareness; continuous density hidden Markov models; feature extraction; maximum-likelihood training; perceptual evaluation; Acoustic testing; Audio recording; Cepstral analysis; Context awareness; Context modeling; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Statistics; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1200023
Filename
1200023
Link To Document