مرکز منطقه ای اطلاع رساني علوم و فناوري - Voice command interpretation for robot control

DocumentCode :

3233265

Title :

Voice command interpretation for robot control

Author :

Punchihewa, Amal ; Arshad, Zuriawati Mohd

Author_Institution :

Sch. of Eng. & Adv. Technol., Massey Univ., Palmerston North, New Zealand

fYear :

2011

fDate :

6-8 Dec. 2011

Firstpage :

Lastpage :

Abstract :

This paper presents some initial results from an analysis of performance of a voice command interpretation and authorisation system using voiceprint to identify the human-commander. Two approaches based on human voice related algorithms are proposed. Mel-frequency cepstral coefficient (MFCC) and perceptual linear predictive (PLP) are two feature extraction methods that are closely mimic the human auditory system. The two methods were applied to the proposed system to determine their suitability for use in a commander recognition system. Vector Quantization (VQ) with Linde-Buzo-Gray (LBG) iterative algorithm was used for clustering for the classification of commanders. The performance of the algorithms was evaluated to compare between two methods in MATLAB simulation environment based on, false rejection rate (rejecting an authorised commander), false acceptance rate (accepting unauthorised commander) and the execution time. Based on the initial results, both methods achieved accurate classification and PLP method has shown better execution time and lower false-acceptance rate compared to the MFCC. The combined approach (MFCC-PLP) did not show considerable improved performances to the individual feature models PLP and MFCC without incurring high computational costs that will compromise the performance of the speaker recognition tasks. Therefore, PLP method is the best candidate for command-recognition system to be developed in the second phase of this research.

Keywords :

authorisation; cepstral analysis; feature extraction; hearing; iterative methods; robots; speaker recognition; vector quantisation; Linde-Buzo-Gray iterative algorithm; MATLAB simulation environment; Mel-frequency cepstral coefficient; authorisation system; commander classification; commander recognition system; false rejection rate; false-acceptance rate; feature extraction methods; human auditory system; human voice related algorithms; human-commander identification; perceptual linear predictive; robot control; speaker recognition tasks; vector quantization; voice command interpretation; Auditory system; Feature extraction; Hidden Markov models; Humans; Speaker recognition; Speech; Speech recognition; identification; interpretation; recognition; robot control; voice command;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automation, Robotics and Applications (ICARA), 2011 5th International Conference on

Conference_Location :

Wellington

Print_ISBN :

978-1-4577-0329-4

Type :

conf

DOI :

10.1109/ICARA.2011.6144862

Filename :

6144862

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3233265