Title :
A new algorithm for high speed speech and audio coding
Author :
Guz, Umit ; Gurkan, Hakan ; Yarman, B. Siddik
Author_Institution :
Speech Group, Int. Comput. Sci. Inst., Berkeley, CA
Abstract :
In this work, a new mathematical modeling approach is proposed for the representation of the speech and audio signals. This approach is based on the generation of the so called predefined signature sequence (PSS) and predefined envelope sequence (PES) sets. After the generation process of the PSS and PES sets, they are clustered by effective k-means clustering algorithm and the PSS and PES are redefined by using the centroids of the clusters. By using this approach, the drawbacks such as the size of the sets, speed of the reconstruction process (computational complexity) which arise in our proposed methods previously are highly eliminated. In spite of these improvements, the initial results proved that, the quality of the reconstructed signals remains within the limitations of the acceptable hearing quality.
Keywords :
audio coding; computational complexity; signal reconstruction; speech coding; audio coding; audio signal representation; computational complexity; hearing quality; k-means clustering algorithm; mathematical modeling; predefined envelope sequence sets; predefined signature sequence sets; signal reconstruction; speech coding; speech signal representation; Audio coding; Auditory system; Bandwidth; Bit rate; Clustering algorithms; Computational complexity; Pulse modulation; Signal to noise ratio; Speech coding; Speech synthesis;
Conference_Titel :
Circuit Theory and Design, 2007. ECCTD 2007. 18th European Conference on
Conference_Location :
Seville
Print_ISBN :
978-1-4244-1341-6
Electronic_ISBN :
978-1-4244-1342-3
DOI :
10.1109/ECCTD.2007.4529566