مرکز منطقه ای اطلاع رساني علوم و فناوري - Text Independent Composite Speaker Identification/Verification Using Multiple Features

DocumentCode :

2620868

Title :

Text Independent Composite Speaker Identification/Verification Using Multiple Features

Author :

Revathi, A. ; Venkataramani, Y.

Author_Institution :

Dept. of ECE, Nat. Inst. of Technol., Trichy, India

Volume :

fYear :

2009

fDate :

March 31 2009-April 2 2009

Firstpage :

257

Lastpage :

261

Abstract :

The main objective of this paper is to explore the effectiveness of feature selection for performing composite speaker identification/verification. We propose features such as line spectral frequency (LSF), differential line spectral frequency (DLSF), mel frequency cepstral coefficients (MFCC), discrete cosine transform cepstrum (DCTC), perceptual linear predictive cepstrum (PLP) and mel frequency perceptual linear predictive cepstrum (MF-PLP). These features are captured and training models are developed by K-means clustering procedure. A speaker identification system is evaluated on noise added test speeches and the experimental results reveal the performance of the proposed algorithm in identifying speakers based on minimum distance between test features and clusters and also highlight the best choice of feature set among all the proposed features for 50 speakers chosen randomly from "TIMIT" database. Analysis is performed on the identification results to emphasize the choice of features which produce better results for speaker verification with respect to equal error rate. In this work, F-ratio is computed as a theoretical measure to validate the experimental results for both identification and verification.

Keywords :

cepstral analysis; discrete cosine transforms; pattern clustering; speaker recognition; K-means clustering; differential line spectral frequency; discrete cosine transform; mel frequency cepstral coefficients; speaker identification; speaker verification; Cepstrum; Clustering algorithms; Data analysis; Discrete cosine transforms; Mel frequency cepstral coefficient; Performance analysis; Spatial databases; Speech analysis; Speech enhancement; System testing; Clustering methods; Discrete cosine transform; Frequency response; Noise; Pseudorandom sequence; Speaker recognition; Spectral analysis; Speech analysis; Speech processing; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Science and Information Engineering, 2009 WRI World Congress on

Conference_Location :

Los Angeles, CA

Print_ISBN :

978-0-7695-3507-4

Type :

conf

DOI :

10.1109/CSIE.2009.926

Filename :

5170321

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2620868