DocumentCode
3411118
Title
Segmentation of a speech spectrogram using mathematical morphology
Author
Steinberg, Raphael ; Shaughnessy, Douglas O.
Author_Institution
INRS-Telecommun., Montreal, QC
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
1637
Lastpage
1640
Abstract
It has been shown that speech spectrograms can be read by trained experts. In this work, we regard the speech spectrogram image as a written text in some unknown language and perform segmentation in order to capture the energy associated with each formant. We propose an algorithm based on Mathematical Morphology operators and mainly on the watershed transform. The result is robust segmentation for wideband speech spectrograms that can be later used for automatic speech recognition. We show results of experimental runs for different phoneme classes.
Keywords
image segmentation; mathematical morphology; speech processing; speech recognition; automatic speech recognition; mathematical morphology operator; phoneme class; speech spectrogram image segmentation; watershed transform; wideband speech spectrogram; Automatic speech recognition; Data mining; Frequency; Image segmentation; Morphology; Optical filters; Skeleton; Spectrogram; Speech recognition; Wideband; Image segmentation; Morphological operations; Optical character recognition; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517940
Filename
4517940
Link To Document