• DocumentCode
    1940427
  • Title

    A compact memory space of dynamic full-text search using Bi-gram index

  • Author

    Atlam, El-Sayed ; Ghada, El-Marhomy ; Fuketa, Masao ; Morita, Kazuhiro ; Aoe, Jun-Ichi

  • Author_Institution
    Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
  • Volume
    1
  • fYear
    2004
  • fDate
    28 June-1 July 2004
  • Firstpage
    104
  • Abstract
    Full-text search is widely used for various services of the Internet. A more high-speed and a more efficient full-text search technology are necessary because of the amount of increasing handled document and corresponding document data every day. This work proposes an adaptive block management algorithm that is efficient for dynamic, data management method. This algorithm is applied for inverted file searching. The new method is speeding up character string retrieval by first making the full-text search of Uni-gram and by the full-text search of Bi-gram. This work proposes a method of enhancing the static full-text search system of Bi-gram to the dynamic full-text search system of Bi-gram. Moreover, This work presents an efficient achievement method of the dynamic full-text search system of Bi-gram using effectiveness of the adaptive block management structure.
  • Keywords
    Internet; document handling; full-text databases; information retrieval; storage management; string matching; Bi-gram; Bi-gram index; Internet; Uni-gram; adaptive block management algorithm; character string retrieval; compact memory space; document handling; dynamic data management method; dynamic full-text search system; inverted file searching; Electronic mail; Indexing; Information retrieval; Information science; Intelligent systems; Microcomputers; Proposals; Web and internet services; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computers and Communications, 2004. Proceedings. ISCC 2004. Ninth International Symposium on
  • Print_ISBN
    0-7803-8623-X
  • Type

    conf

  • DOI
    10.1109/ISCC.2004.1358389
  • Filename
    1358389