• DocumentCode
    3411118
  • Title

    Segmentation of a speech spectrogram using mathematical morphology

  • Author

    Steinberg, Raphael ; Shaughnessy, Douglas O.

  • Author_Institution
    INRS-Telecommun., Montreal, QC
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    1637
  • Lastpage
    1640
  • Abstract
    It has been shown that speech spectrograms can be read by trained experts. In this work, we regard the speech spectrogram image as a written text in some unknown language and perform segmentation in order to capture the energy associated with each formant. We propose an algorithm based on Mathematical Morphology operators and mainly on the watershed transform. The result is robust segmentation for wideband speech spectrograms that can be later used for automatic speech recognition. We show results of experimental runs for different phoneme classes.
  • Keywords
    image segmentation; mathematical morphology; speech processing; speech recognition; automatic speech recognition; mathematical morphology operator; phoneme class; speech spectrogram image segmentation; watershed transform; wideband speech spectrogram; Automatic speech recognition; Data mining; Frequency; Image segmentation; Morphology; Optical filters; Skeleton; Spectrogram; Speech recognition; Wideband; Image segmentation; Morphological operations; Optical character recognition; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4517940
  • Filename
    4517940