DocumentCode
3425899
Title
Gender identification from E-mails
Author
Cheng, Na ; Chen, Xiaoling ; Chandramouli, R. ; Subbalakshmi, K.P.
Author_Institution
Dept. of ECE, Stevens Inst. of Technol., Hoboken, NJ
fYear
2009
fDate
March 30 2009-April 2 2009
Firstpage
154
Lastpage
158
Abstract
In this paper, we investigate the topic of gender identification for short length, multi-genre, content-free e-mails. We introduce for the first time (to our knowledge), psycholinguistic and gender-linked cues for this problem, along with traditional stylometric features. Decision tree and support vector machines learning algorithms are used to identify the gender of the author of a given e-mail. The experiment results show that our approach is promising with an average accuracy of 82.2%.
Keywords
decision trees; electronic mail; learning (artificial intelligence); support vector machines; content-free e-mail; decision tree; gender identification; gender-linked cues; multi-genre e-mail; short length e-mail; stylometric features; support vector machines learning algorithms; Computer mediated communication; Decision trees; Electronic mail; Helium; Internet; Machine learning; Natural languages; Psychology; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
Conference_Location
Nashville, TN
Print_ISBN
978-1-4244-2765-9
Type
conf
DOI
10.1109/CIDM.2009.4938643
Filename
4938643
Link To Document