DocumentCode :
173182
Title :
Sparsification of voice data using Discrete Rajan Transform and its applications in speaker recognition
Author :
Prashanthi, G. ; Singh, Sushil ; Rajan, E.G. ; Krishnan, Prasad
Author_Institution :
Pentagram Res. Centre Pvt Ltd., Hyderabad, India
fYear :
2014
fDate :
5-8 Oct. 2014
Firstpage :
429
Lastpage :
434
Abstract :
This paper proposes a novel technique of sparsing speech data and compressing it in spectral domain. Discrete Rajan Transform is applied to voice data and the spectrum is sparsed by retaining the first component CPI (Cumulative Point Index) of the spectrum and forcing the other spectral components to zero. Thus the spectrum could be compressed to a maximum of 12.5% of the original data. As and when required the compressed spectrum could be synthesized using Inverse Discrete Rajan Transform and the reconstructed speech data analyzed for speaker recognition. Speaker recognition accuracy to a maximum of 93.5% has been obtained in this case.
Keywords :
Hadamard transforms; data compression; discrete transforms; encoding; inverse transforms; speaker recognition; spectral analysis; CPI; compressed spectrum synthesis; cumulative point index; inverse discrete Rajan transform; reconstructed speech data analysis; speaker recognition accuracy; spectral components; spectral domain; spectrum sparsing; speech data compression; speech data sparsing technique; voice data sparsification; Accuracy; Databases; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Transforms; Discrete Rajan Transform; MFCC and Fuzzy Vector Quantization; Speech Processing and Speaker Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
Conference_Location :
San Diego, CA
Type :
conf
DOI :
10.1109/SMC.2014.6973945
Filename :
6973945
Link To Document :
بازگشت