DocumentCode :
3674647
Title :
An image based approach for speech perception
Author :
Nguyen Quang Trung;Bui The Duy;Ma Thi Chau
Author_Institution :
Human Machine Interaction Laboratory, University of Engineering &
fYear :
2015
Firstpage :
208
Lastpage :
213
Abstract :
Classification of speech signal is one of the most vital problems in speech perception and spoken word recognition. Although, there have been many studies on the classification of speech signals but the results are still limited. In this paper, we propose an image based approach for speech signal classification based on the combination of Local Naïve Bayes Nearest Neighbor (LNBNN) and Scale-invariant Feature Transform (SIFT) features. The proposed approach allows training feature vectors to have different sizes and no re-training is needed for additional training data after training phase. With this approach, achieved classification results are very satisfactory. They are 72.8%, 100% and 95.0% on the ISOLET, Digits and Places databases, respectively.
Keywords :
"Speech","Speech recognition","Feature extraction","Databases","Hidden Markov models","Mel frequency cepstral coefficient","Training"
Publisher :
ieee
Conference_Titel :
Information and Computer Science (NICS), 2015 2nd National Foundation for Science and Technology Development Conference on
Print_ISBN :
978-1-4673-6639-7
Type :
conf
DOI :
10.1109/NICS.2015.7302192
Filename :
7302192
Link To Document :
بازگشت