Title :
A fuzzy based approach to stylometric analysis of blogger´s age and gender
Author :
Goswami, Suparna ; Shishodia, Mayank Singh
Author_Institution :
Indian Inst. of Technol., Kharagpur, Kharagpur, India
Abstract :
Fuzzy logic deals with partial truth. A fuzzy based approach to blog analysis, on the basis of various feature words, allows us to determine the degree to which a blogger´s style belongs to a particular age or gender group. Each blog was represented by a set of normalized word frequencies of selected feature words in it. Using membership values obtained from applying Fuzzy C-Means (FCM) algorithm to these blog representations, we can call the blogger´s style to belong weakly, fairly, strongly or very strongly to a particular class. The advantage of using fuzzy logic for this problem is that a weak belonging to a particular class means that there is a decent belonging to the other class (es). Hence when a search or query is carried out, no useful blog will be left out of the results for that other class (es).
Keywords :
Web sites; fuzzy logic; fuzzy set theory; pattern clustering; text analysis; FCM algorithm; age group; blog analysis; blog representation; blogger age; blogger style; bloggergender; feature words; fuzzy C-means algorithm; fuzzy based approach; fuzzy logic; gender group; membership value; normalized word frequency; stylometric analysis; Decision support systems; Helium; Hybrid intelligent systems; Mercury (metals); Rail to rail outputs; age; blog; clustering; fuzzy c-means; fuzzy logic; gender; stylometrics;
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2012 12th International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4673-5114-0
DOI :
10.1109/HIS.2012.6421307