• DocumentCode
    2372287
  • Title

    Using linear regression residual of document vectors in text categorization

  • Author

    Altincay, H.

  • Author_Institution
    Bilgisayar Muhendisligi Bolumu, Dogu Akdeniz Univ., Gazimağusa, Turkey
  • fYear
    2013
  • fDate
    24-26 April 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The use of linear regression residual for binary text categorization is studied. The main idea is to predict the given test vector using its k nearest neighbors in both positive and negative classes. The predicted vectors are the projections of the test vector onto the subspaces of different classes. The differences between the test vector and the projections are known as the residual vectors. The magnitudes of these vectors show the effectiveness of the neighbors in different classes to represent the test vector. The residuals obtained from both positive and negative classes are cancatenated with the document vectors computed using bag of words approach. Experimental results on three widely used datasets have shown that residual vectors provide improved document representation.
  • Keywords
    data structures; pattern classification; regression analysis; text analysis; vectors; binary text categorization; document representation; document vectors; k nearest neighbors; linear regression residual; residual vectors; Face recognition; Linear regression; Marine vehicles; Radio frequency; Support vector machines; Text categorization; Vectors; document representation; linear regression; residual vector; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2013 21st
  • Conference_Location
    Haspolat
  • Print_ISBN
    978-1-4673-5562-9
  • Electronic_ISBN
    978-1-4673-5561-2
  • Type

    conf

  • DOI
    10.1109/SIU.2013.6531161
  • Filename
    6531161