• DocumentCode
    1886405
  • Title

    Enhanced CAFÉ indexing algorithm using hashing function

  • Author

    Rashid, NurAini Binti Abdul ; Ghadban, Rana ; Hamdani, Hazrina Yusof ; A-Abdulrazaq, Atheer

  • Author_Institution
    Sch. of Comput. Sci., Univ. Sains Malaysia, Minden, Malaysia
  • Volume
    3
  • fYear
    2010
  • fDate
    15-17 June 2010
  • Firstpage
    1468
  • Lastpage
    1472
  • Abstract
    The rapid growth of genomic databases and the increased of queries against those databases have lead to the needs of new and efficient search and compare techniques. Researcher in bioinformatics have concentrated on exploring into different approaches in order to solve the problem of cost associated with the exhaustive search techniques. One of these is the CAFE indexing algorithm which is considered to be a fast indexing algorithm in genomic information retrieval. However, there is still room for improvement in the CAFE indexing structure. This research aims to enhance the structure of CAFE inverted index by using a proper hash function to speedup retrieval process. The results of this research indicated that retrieval using the enhanced index is faster than retrieval using the original index (CAFÉ). The benefiot ratio of using the enhanced CAFE index compared to the retrieval time using the original CAFE index are between 62.8 to 74.9 for one query. However, we found that the memory space for storing the indexes are the same for both algorithms. The reason is that although the interval size decreases, each interval will now have an increased number of posting list.
  • Keywords
    bioinformatics; database management systems; file organisation; indexing; query processing; CAFE indexing algorithm; bioinformatics; genomic databases; genomic information retrieval; hashing function; queries; Bioinformatics; Buildings; DNA; Genomics; Indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology (ITSim), 2010 International Symposium in
  • Conference_Location
    Kuala Lumpur
  • ISSN
    2155-897
  • Print_ISBN
    978-1-4244-6715-0
  • Type

    conf

  • DOI
    10.1109/ITSIM.2010.5561622
  • Filename
    5561622