DocumentCode
35296
Title
Fuzzy and Crisp Recursive Profiling of Online Reviewers and Businesses
Author
Lingras, Pawan ; Triff, Matt
Author_Institution
Dept. of Math. & Comput. Sci., St. Mary´s Univ., Halifax, NS, Canada
Volume
23
Issue
4
fYear
2015
fDate
Aug. 2015
Firstpage
1242
Lastpage
1258
Abstract
Users of online review sites can benefit from knowing the profiles of the businesses, as well as the profiles of reviewers who reviewed the businesses. This paper describes crisp and fuzzy metaclustering techniques to evolve two recursively defined clustering schemes of both businesses and reviewers in parallel using a real-world dataset supplied by yelp.com. The objective is to profile the businesses and reviewers by grouping them based on similar characteristics. The novelty of the proposed approach is in the fact that the representations of both businesses and reviewers change dynamically throughout the metaclustering process. A business is represented by static information obtained from the database and dynamic information obtained from the clustering of reviewers who reviewed the business. Similarly, the reviewer representation augments the static representation from the database with profiles of businesses who are reviewed by these reviewers. The resulting web-based service provides a facility for users to find similar businesses/reviewers based on the category of the business, rating, number of reviews, and number of check-ins. It also provides a succinct profile of a business or reviewer based on these factors so that the users can put the reviews in context. Since an object can belong to multiple clusters in fuzzy metaclustering, it is possible to absorb some of the extreme groups consisting of outliers in one of the mainstream clusters. As a result, the fuzzy metaclustering leads to more uniformly distributed and moderate profiles.
Keywords
Internet; Web services; business data processing; data mining; fuzzy set theory; pattern clustering; social networking (online); Web-based service; crisp metaclustering techniques; crisp recursive profiling; dynamic information; fuzzy metaclustering techniques; fuzzy recursive profiling; online businesses; online review site users; online reviewers; reviewer representation; static information; Business; Clustering algorithms; Context; Databases; Heuristic algorithms; Rough sets; Vectors; Fuzzy c-means; k-means; metaclustering; profiling; web mining;
fLanguage
English
Journal_Title
Fuzzy Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-6706
Type
jour
DOI
10.1109/TFUZZ.2014.2349532
Filename
6880357
Link To Document