• DocumentCode
    2212122
  • Title

    Fractal dimension segmentation: isolated speech recognition

  • Author

    Fekkai, S. ; Al-Akaidi, M. ; Blackledge, J.

  • Author_Institution
    Fac. of Comput. Sci. & Eng., De Montfort Univ., Leicester, UK
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    42461
  • Lastpage
    42465
  • Abstract
    This paper investigates the use of fractal geometry for segmenting digital signals. A method of texture segmentation is introduced which is based on the fractal dimension. Using this approach, variations in texture across a signal or image can be characterized in terms of variations in the fractal dimension. By analyzing the spatial fluctuations in fractal dimension obtained using a conventional moving window approach, a digital signal or image can be texture segmented; this is the principle of fractal dimension segmentation (FDS). In this paper, we apply this form of texture segmentation to isolated speech signals. An overview of methods for computing the fractal dimension is presented focusing on an approach that makes use of the characteristic power density function (PSDF) of a random scaling fractal signal. FDS is applied to a number of different speech signals and the results discussed for isolated words and the components (e.g. fricatives) from which these words are composed. In particular, it is shown that by pre-filtering speech signals with a low-pass filter of the form 1/k. This provides confidence in the approach to speech segmentation considered in this paper and in principle, allows a template matching scheme to be designed that is based exclusively on FDS
  • Keywords
    speech recognition; fractal dimension segmentation; fractal geometry; fricatives; isolated speech recognition; isolated speech signals; isolated words; low-pass filter; moving window approach; power density function; pre-filtering speech signals; random scaling fractal signal; spatial fluctuations; speech segmentation; template matching; texture segmentation;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Speech Coding for Algorithms for Radio Channels (Ref. No. 2000/012), IEE Seminar
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1049/ic:20000042
  • Filename
    855147