• DocumentCode
    2005430
  • Title

    Discovering pattern of onomatopoeia used in food reviews

  • Author

    Kato, Akira ; Fukazawa, Yoshiaki ; Mori, Takayoshi

  • Author_Institution
    Grad. Sch. of Interdiscipl., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2012
  • fDate
    20-24 Nov. 2012
  • Firstpage
    502
  • Lastpage
    507
  • Abstract
    In this paper, we investigated onomatopoeia usage pattern in food reviews by proposing LDA (Latent Dirichlet Allocation) based onomatopoeia usage pattern analysis model. We collected total 685 numbers of onomatopoeias which are distributed to 208 food categories from 3,581,808 food reviews of Japanese food review site Tabelog. From the experimental result, we found several patterns how the onomatopoeias are chosen. The onomatopoeia is chosen based on user´s interest on the combination of {location of food, material of the food, cooking method} and {the texture of food, sound when eating, and looks of food people´s status when eating the food}. In addition, we investigate how the precision of the clustering result changes depending on the N (number of onomatopoeia of each food categories). We found that the results of N=30 is better than one of N=100 as large number of onomatopoeia for each food categories like 100 is likely to include onomatopoeias that are irrelevant to food.
  • Keywords
    Web sites; data mining; food products; pattern clustering; reviews; Japanese food review site; LDA based onomatopoeia usage pattern analysis model; Tabelog site; cooking method; food categories; food material; food reviews; food texture; latent Dirichlet allocation; onomatopoeia pattern discovery; people status; user interest; LDA; food review; new word discovery; onomatopoeia; term weighting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
  • Conference_Location
    Kobe
  • Print_ISBN
    978-1-4673-2742-8
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2012.6505214
  • Filename
    6505214