• DocumentCode
    3206948
  • Title

    ACE: improving search engines via Automatic Concept Extraction

  • Author

    Ramirez, Paul M. ; Mattmann, Chris A.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2004
  • fDate
    8-10 Nov. 2004
  • Firstpage
    229
  • Lastpage
    234
  • Abstract
    The proliferation of the Internet has caused the process of browsing and searching for information to become extremely cumbersome. While many search engines provide reasonable information., they still fall short by overwhelming users with a multitude of often irrelevant results. This problem has several causes but most notably is the inability for the user to be able to convey the context of their search. Unfortunately, search engines must assume a general context when looking for matching pages, causing users to visit each page in the result list to ultimately find or not find their desired result. We believe that the necessity of visiting each page could be removed if the concepts, i.e. over-arching ideas of the underlying page, could be revealed to the end user. This would require mining the concepts from each referenced page. It is our contention that this could be done automatically, rather than relying on the current convention of mandating that the searcher extract these concepts manually through examination of result links. This ability to mine concepts would not only be useful to finding the appropriate result but in further identifying relevant pages. We present the Automatic Concept Extraction (ACE) algorithm, which can aid users performing searches using search engines. We discuss ACE both theoretically, and in the context of a graphical user interface and implementation which we have constructed in Java to aid in qualitatively evaluating our algorithm. ACE is found to perform at least as well or better than 4 other related algorithms, which we survey in the literature.
  • Keywords
    Internet; Java; graphical user interfaces; information retrieval; online front-ends; search engines; ACE algorithm; Automatic Concept Extraction algorithm; Internet; Java; graphical user interface; information browsing; search engines; Computer science; Data mining; Graphical user interfaces; Information filtering; Information filters; Internet; Java; Search engines; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2004. IRI 2004. Proceedings of the 2004 IEEE International Conference on
  • Print_ISBN
    0-7803-8819-4
  • Type

    conf

  • DOI
    10.1109/IRI.2004.1431465
  • Filename
    1431465