• DocumentCode
    538306
  • Title

    Cross-language identification using the wavelet transform and artificial neural network

  • Author

    Al-Dubaee, Shawki A. ; Ahmad, Nesar

  • Author_Institution
    Dept. of Comput. Eng., Aligarh Muslim Univ., Aligarh, India
  • fYear
    2010
  • fDate
    13-15 Dec. 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    With the advent of the Internet, search engines were developed for English language because English language was a lingua franca. Currently, most of popular search engines such as Google and Yahoo! are available in more than 50 languages. However, these search engines have received less attention in South Asian languages especially, Urdu language. In this paper, we propose a novel approach for feature extraction and classification of queries in cross-language search engines. This novel approach presents an automatic method for classification of English and Urdu languages identification. The classifier used is a three-layered feedforward artificial neural network and the feature vector is formed by calculating the wavelet coefficients. Three wavelet decomposition functions (filters), namely Haar, Bior 2.2 and Bior 3.1 have been used to extract the feature vector set and their performance results have been compared. The performance results of the Haar filter have given superior results than other filters.
  • Keywords
    Haar transforms; feedforward neural nets; natural language processing; search engines; wavelet transforms; Bior 2.2 wavelet decomposition function; Bior 3.1 wavelet decomposition function; English language; Haar wavelet decomposition function; Urdu language; cross-language identification; cross-language search engines; feedforward artificial neural network; query feature classification; query feature extraction; wavelet transform; Artificial neural networks; Feature extraction; Internet; Multiresolution analysis; Search engines; Wavelet transforms; Unicode; Wavelet transforms; artificial neural network; cross-language; language identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Kaleidoscope: Beyond the Internet? - Innovations for Future Networks and Services, 2010 ITU-T
  • Conference_Location
    Pune
  • Print_ISBN
    978-1-4244-8272-6
  • Electronic_ISBN
    978-92-61-13171-5
  • Type

    conf

  • Filename
    5682132