• DocumentCode
    1824322
  • Title

    Language independent gender classification on Twitter

  • Author

    Alowibdi, Jalal S. ; Buy, Ugo A. ; Yu, Paul

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Chicago, Chicago, IL, USA
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    739
  • Lastpage
    743
  • Abstract
    Online Social Networks (OSNs) generate a huge volume of user-originated texts. Gender classification can serve multiple purposes. For example, commercial organizations can use gender classification for advertising. Law enforcement may use gender classification as part of legal investigations. Others may use gender information for social reasons. Here we explore language independent gender classification. Our approach predicts gender using five color-based features extracted from Twitter profiles (e.g., the background color in a user´s profile page). Most other methods for gender prediction are typically language dependent. Those methods use high-dimensional spaces consisting of unique words extracted from such text fields as postings, user names, and profile descriptions. Our approach is independent of the user´s language, efficient, and scalable, while attaining a good level of accuracy. We prove the validity of our approach by examining different classifiers over a large dataset of Twitter profiles.
  • Keywords
    feature extraction; pattern classification; social networking (online); text analysis; OSN; Twitter profiles; advertising; background color; color-based feature extraction; high-dimensional spaces; language dependent gender prediction; language independent gender classification; law enforcement; legal investigations; online social networks; social reasons; text fields; user language; user names; user postings; user profile descriptions; user profile page; user-originated texts; word extraction; Accuracy; Color; Feature extraction; Image color analysis; Quantization (signal); Sorting; Twitter; Application for Social Network; Color-based Features; Language Independent; Social Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
  • Conference_Location
    Niagara Falls, ON
  • Type

    conf

  • Filename
    6785785