• DocumentCode
    1396480
  • Title

    Enhanced approach for latent semantic indexing using wavelet transform

  • Author

    Jaber, Tareq ; Amira, Abbes ; Milligan, P.

  • Author_Institution
    Fac. of Comput. & Inf. Technol., King Abdulaziz Univ., Jeddah, Saudi Arabia
  • Volume
    6
  • Issue
    9
  • fYear
    2012
  • fDate
    12/1/2012 12:00:00 AM
  • Firstpage
    1236
  • Lastpage
    1245
  • Abstract
    Latent semantic indexing (LSI) is a technique used for intelligent information retrieval (IR). It can be used as an alternative to traditional keyword matching IR and is attractive in this respect because of its ability to overcome problems with synonymy and polysemy. This study investigates various aspects of LSI: the effect of the Haar wavelet transform (HWT) as a preprocessing step for the singular value decomposition (SVD) in the key stage of the LSI process; and the effect of different threshold types in the HWT on the search results. The developed method allows the visualisation and processing of the term document matrix, generated in the LSI process, using HWT. The results have shown that precision can be increased by applying the HWT as a preprocessing step, with better results for hard thresholding than soft thresholding, whereas standard SVD-based LSI remains the most effective way of searching in terms of recall value.
  • Keywords
    Haar transforms; information retrieval; singular value decomposition; wavelet transforms; Haar wavelet transform; intelligent information retrieval; keyword matching IR; latent semantic indexing; polysemy problem; preprocessing step; singular value decomposition; synonymy problem; term document matrix; visualisation;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9659
  • Type

    jour

  • DOI
    10.1049/iet-ipr.2011.0498
  • Filename
    6407283