• DocumentCode
    2718240
  • Title

    Mining the blogosphere to generate local cuisine hotspots for mobile map service

  • Author

    Shih, Chia-Chun ; Peng, Ting-Chun ; Lai, Wei-Shen

  • Author_Institution
    Inst. for Inf. Ind., Taipei, Taiwan
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    On-the-go consumers require dynamic information, particularly "word of mouth, " to make better purchase decisions. A popular genre of mobile map services is travel/cuisine, which is a popular topic for bloggers as well. This study attempts to generate local cuisine hotspot maps through blog content mining. The main obstacle in doing this involves recognizing and extracting restaurants and essential restaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from Chinese blog posts and achieves satisfactory performance. The extraction results are arranged into hotspots and presented in map views. The extracted information can be fed back to POI (Point of Interest) databases, and thus a brand-new POI database comprising information extracted from User Generated Content (UGC) can be realized.
  • Keywords
    Web sites; catering industry; data mining; pattern classification; blog content mining; blogosphere mining; information extraction; local cuisine hotspot generation; mobile map service; named entity recognition; point of interest database; user generated content; Cities and towns; Data mining; Databases; Information services; Internet; Mining industry; Mouth; Target recognition; User-generated content; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management, 2009. ICDIM 2009. Fourth International Conference on
  • Conference_Location
    Ann Arbor, MI
  • Print_ISBN
    978-1-4244-4253-9
  • Electronic_ISBN
    978-1-4244-4254-6
  • Type

    conf

  • DOI
    10.1109/ICDIM.2009.5356785
  • Filename
    5356785