Title :
A Particle Swarm Optimization-Based Approach to Speaker Segmentation Based on Independent Component Analysis on GSM Digital Speech
Author :
Mirrezaie, S.M. ; Faez, Karim ; Asnaashari, Amir ; Ziaei, Ali
Author_Institution :
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
Abstract :
Adaptive Multi-Rate (AMR) codec was standardized for GSM in 1999. AMR offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. The Adaptive Multi-Rate speech codec is adopted as a standard for IMT-2000 by ETSI and 3GPP and consists of eight source codecs with bit rates from 4.75 to 12.2 kbit/s. In this paper, we present an approach comprising of particle swarm optimization (PSO), which encodes possible segmentations of an audio record, and measures mutual information between these segments and the audio data. This measure is used as the fitness function for the PSO. A compact encoding of the solution for PSO which decreases the length of the PSO individuals and enhances the PSO convergence properties is adopted. The algorithm has been tested on two actual sets of data with AMR format for speaker segmentation, obtaining very good results in all test problems. The results have been compared to the widely used a genetic algorithm-based in several practical situations. No assumptions have been made about prior knowledge of speech signal characteristics. However, we assume that the speakers do not speak simultaneously and that we have no real-time constraints.
Keywords :
adaptive codes; audio coding; cellular radio; channel coding; independent component analysis; particle swarm optimisation; speech codecs; speech coding; GSM digital speech codec; PSO convergence; adaptive multirate speech codec; audio record segmentation; channel coding; independent component analysis; particle swarm optimization; speaker segmentation; Channel coding; Code standards; GSM; Independent component analysis; Particle swarm optimization; Robustness; Speech analysis; Speech codecs; Speech coding; Testing; Adaptive multirate (AMR); genetic algorithm; mutual information; particle swarm optimization (PSO); speaker segmentation;
Conference_Titel :
Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on
Conference_Location :
Sarajevo
Print_ISBN :
978-1-4244-3554-8
Electronic_ISBN :
978-1-4244-3555-5
DOI :
10.1109/ISSPIT.2008.4775731