• DocumentCode
    1571189
  • Title

    Deep into web general vs vertical search engine design based on secure and QoS

  • Author

    Da-quan, Wang ; Tian, Wang ; Lin, Zhang ; Ai-ping, Wu ; Qi-li, Zhou ; Xiao-kai, Wu

  • Author_Institution
    Comput. Coll., Hangzhou Dianzi Univ., Hangzhou, China
  • Volume
    1
  • fYear
    2011
  • Firstpage
    847
  • Lastpage
    851
  • Abstract
    Vertical search engines are targeted to specific areas of the network information of the coverage is relatively high, with a reliable technical and information resources and support, with clear targeting search effectively compensate for a comprehensive search engine on a specific topic areas of expertise and information coverage too low. Mainly by the vertical search engine focused crawler module, the index module, search module, user interface components such as 4, it is the first to use the module from the specified URL reptiles seed starts to crawl, to crawl down the web page content analysis, determine the required after the extraction of information for the structured data, and then the data on the structure of Chinese words segmentation and indexing, and generate an index database, and finally create web pages for users to query the module to search. Database storage is a prerequisite for building the search. Foreground is the search engine system with the user interface.
  • Keywords
    Internet; Web sites; indexing; information retrieval; natural language processing; quality of service; search engines; Chinese words segmentation; QoS; URL reptiles; Web general search engine design; Web page content analysis; Web pages; Web vertical search engine design; clear targeting search; crawler module; index database; indexing; information extraction; information resources; network information; search engine system; structured data; user interface; Economics; HTML; Indexing; Information filters; Message systems; Web; crawling; database; engine; index;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4244-9792-8
  • Type

    conf

  • DOI
    10.1109/CSQRWC.2011.6037083
  • Filename
    6037083