DocumentCode :
3233265
Title :
Voice command interpretation for robot control
Author :
Punchihewa, Amal ; Arshad, Zuriawati Mohd
Author_Institution :
Sch. of Eng. & Adv. Technol., Massey Univ., Palmerston North, New Zealand
fYear :
2011
fDate :
6-8 Dec. 2011
Firstpage :
90
Lastpage :
95
Abstract :
This paper presents some initial results from an analysis of performance of a voice command interpretation and authorisation system using voiceprint to identify the human-commander. Two approaches based on human voice related algorithms are proposed. Mel-frequency cepstral coefficient (MFCC) and perceptual linear predictive (PLP) are two feature extraction methods that are closely mimic the human auditory system. The two methods were applied to the proposed system to determine their suitability for use in a commander recognition system. Vector Quantization (VQ) with Linde-Buzo-Gray (LBG) iterative algorithm was used for clustering for the classification of commanders. The performance of the algorithms was evaluated to compare between two methods in MATLAB simulation environment based on, false rejection rate (rejecting an authorised commander), false acceptance rate (accepting unauthorised commander) and the execution time. Based on the initial results, both methods achieved accurate classification and PLP method has shown better execution time and lower false-acceptance rate compared to the MFCC. The combined approach (MFCC-PLP) did not show considerable improved performances to the individual feature models PLP and MFCC without incurring high computational costs that will compromise the performance of the speaker recognition tasks. Therefore, PLP method is the best candidate for command-recognition system to be developed in the second phase of this research.
Keywords :
authorisation; cepstral analysis; feature extraction; hearing; iterative methods; robots; speaker recognition; vector quantisation; Linde-Buzo-Gray iterative algorithm; MATLAB simulation environment; Mel-frequency cepstral coefficient; authorisation system; commander classification; commander recognition system; false rejection rate; false-acceptance rate; feature extraction methods; human auditory system; human voice related algorithms; human-commander identification; perceptual linear predictive; robot control; speaker recognition tasks; vector quantization; voice command interpretation; Auditory system; Feature extraction; Hidden Markov models; Humans; Speaker recognition; Speech; Speech recognition; identification; interpretation; recognition; robot control; voice command;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automation, Robotics and Applications (ICARA), 2011 5th International Conference on
Conference_Location :
Wellington
Print_ISBN :
978-1-4577-0329-4
Type :
conf
DOI :
10.1109/ICARA.2011.6144862
Filename :
6144862
Link To Document :
بازگشت