DocumentCode
2960309
Title
Benefits of prior speech segmentation for best time-frequency visualisation using Renyi´s entropy
Author
Boutana, Daoud ; Benidir, Messaoud
Author_Institution
Univ. de Jijel, Jijel
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
688
Lastpage
691
Abstract
In this paper, a new approach that operates in the joint time-frequency domain for speech segmentation is presented. Segmentation is an important application in speech and audio processing. The segmentation in time domain is based on Renyi entropy especially on Renyi marginal entropy (RME) properties. Experiments were conducted using real-life speech signal as consonant-vowel (CV) transition that consists of two different events. They demonstrated the ability of the method for segmentation of speech signal made of CV transition. This technique is also useful for best time-frequency visualization with appropriate parameters. Because of the simplicity and effectiveness of proposed segmentation technique, it can be applied in many applications such as speaker identification/verification, estimation of the duration of the plosives, feature extraction, and classification.
Keywords
audio signal processing; entropy; speech processing; time-frequency analysis; Renyi entropy; Renyi marginal entropy properties; audio processing; consonant-vowel transition; real-life speech signal; speech processing; speech segmentation technique; time-frequency domain; time-frequency visualisation; Entropy; Feature extraction; Frequency estimation; Frequency measurement; Speech enhancement; Speech processing; Speech recognition; Time frequency analysis; Time measurement; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Circuits and Systems, 2006. ICECS '06. 13th IEEE International Conference on
Conference_Location
Nice
Print_ISBN
1-4244-0395-2
Electronic_ISBN
1-4244-0395-2
Type
conf
DOI
10.1109/ICECS.2006.379882
Filename
4263460
Link To Document