DocumentCode :
3494920
Title :
Multimodal speech recognition of a person with articulation disorders using AAM and MAF
Author :
Miyamoto, Chikoto ; Komai, Yuto ; Takiguchi, Tetsuya ; Ariki, Yasuo ; Li, Ichao
Author_Institution :
Grad. Sch. of Eng., Kobe Univ., Kobe, Japan
fYear :
2010
fDate :
4-6 Oct. 2010
Firstpage :
517
Lastpage :
520
Abstract :
We investigated the speech recognition of a person with articulation disorders resulting from athetoid cerebral palsy. The articulation of speech tends to become unstable due to strain on speech-related muscles, and that causes degradation of speech recognition. Therefore, we use multiple acoustic frames (MAF) as an acoustic feature to solve this problem. Further, in a real environment, current speech recognition systems do not have sufficient performance due to noise influence. In addition to acoustic features, visual features are used to increase noise robustness in a real environment. However, there are recognition problems resulting from the tendency of those suffering from cerebral palsy to move their head erratically. We investigate a pose-robust audio-visual speech recognition method using an Active Appearance Model (AAM) to solve this problem for people with articulation disorders resulting from athetoid cerebral palsy. AAMs are used for face tracking to extract pose-robust facial feature points. Its effectiveness is confirmed by word recognition experiments on noisy speech of a person with articulation disorders.
Keywords :
diseases; face recognition; feature extraction; pose estimation; speech recognition; AAM; MAF; acoustic feature; active appearance model; articulation disorder; athetoid cerebral palsy; multimodal speech recognition; multiple acoustic frame; pose-robust audio-visual speech recognition method; pose-robust facial feature point extraction; visual feature; Acoustics; Active appearance model; Face; Feature extraction; Hidden Markov models; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
Conference_Location :
Saint Malo
Print_ISBN :
978-1-4244-8110-1
Electronic_ISBN :
978-1-4244-8111-8
Type :
conf
DOI :
10.1109/MMSP.2010.5662075
Filename :
5662075
Link To Document :
بازگشت