Title :
Class-specific classifiers in audio-visual speech recognition
Author :
Estellers, Virginia ; Baggenstoss, Paul M. ; Thiran, Jean-Philippe
Author_Institution :
Signal Process. Lab. (LTS5), Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
Abstract :
In this paper, class-specific classifiers for audio, visual and audiovisual speech recognition systems are developed and compared with traditional Bayes classifiers. We use state-of-the-art feature extraction methods and develop traditional and class-specific classifiers for speech recognition, showing the benefits of a class-specific method on each modality for speaker dependent and independent set-ups. Experiments with a reference audio-visual database show a general increase in the systems performance by the introduction of class-specific techniques on both visual and audio-visual modalities.
Keywords :
audio-visual systems; feature extraction; pattern classification; speech recognition; audio-visual modality; audio-visual speech recognition; class specific classifier; feature extraction method; reference audio-visual database; speaker dependent setup; speaker independent setup; Hidden Markov models; Mel frequency cepstral coefficient; Principal component analysis; Signal to noise ratio; Speech recognition; Transforms; Visualization;
Conference_Titel :
Signal Processing Conference, 2010 18th European
Conference_Location :
Aalborg