مرکز منطقه ای اطلاع رساني علوم و فناوري - Hardware/software co-design for fast-trainable speaker identification system based on SMO

DocumentCode :

2380580

Title :

Hardware/software co-design for fast-trainable speaker identification system based on SMO

Author :

Wang, Jhing-Fa ; Peng, Shiang ; Wang, Jia-Ching ; Lin, Po-Chuan ; Kuan, Ta-Wen

Author_Institution :

Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan

fYear :

2011

fDate :

9-12 Oct. 2011

Firstpage :

1621

Lastpage :

1625

Abstract :

Embedded speaker identification system is a popular research, but most of current systems can not provide fast training ability. Because of the low computational ability in the embedded environment, a large amount of waiting time usually makes the human-machine interface not friendly. This paper presents a hardware and software (HW/SW) co-design solution for fast-trainable speaker identification system. Fast training ability makes this embedded speaker identification system possess high flexibility and enhances the convenience to a wide range of real-world applications. The proposed system consists of a training phase and a multiclass identification phase. The sequential minimal optimization (SMO) training algorithm occupies the heaviest computational load and is realized as a dedicated VLSI module, i.e., the hardware component. The other processes such as speech preprocess, speech feature extraction, and SVM voting strategy are implemented by software. Moreover, a data-packed mechanism is presented to improve the bandwidth utilization. Compared with the embedded C code based on ARM processor, our system reduces 90% of the training time and achieves 89.9% identification rate with the NIST 2010 speaker recognition database. The proposed system was tested and found to be fully functional working on a Socle CDK prototype system with an AMBA based Xilinx FPGA and an ARM926EJ processor.

Keywords :

VLSI; computational complexity; embedded systems; hardware-software codesign; optimisation; speaker recognition; support vector machines; ARM processor; ARM926EJ processor; NIST 2010 speaker recognition database; SVM voting strategy; Socle CDK prototype; VLSI module; a AMBA based Xilinx FPGA; computational load; data-packed mechanism; embedded C code; embedded system; fast-trainable speaker identification system; hardware-software codesign; human-machine interface; multiclass identification phase; sequential minimal optimization; sequential minimal optimization training algorithm; speech feature extraction; speech preprocess; training phase; Algorithm design and analysis; Feature extraction; Hardware; Speaker recognition; Speech; Support vector machines; Training; Hardware/Software Co-design; Sequential Minimal Optimization (SMO); Speaker Identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on

Conference_Location :

Anchorage, AK

ISSN :

1062-922X

Print_ISBN :

978-1-4577-0652-3

Type :

conf

DOI :

10.1109/ICSMC.2011.6083903

Filename :

6083903

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2380580