• DocumentCode
    2106457
  • Title

    Feature Extraction in Text Clustering Based on Theme

  • Author

    Shi, Nianyun ; Jing, Kong ; Xu, Jiuyun ; Duan, Yongxiang ; Li, Chunhua

  • Author_Institution
    Coll. of Comput. Sci. & Commun. Eng., China Univ. of Pet., Dongying
  • fYear
    2008
  • fDate
    21-22 Dec. 2008
  • Firstpage
    632
  • Lastpage
    635
  • Abstract
    A new method is proposed, which refers to feature extraction based on oil theme of concept hierarchy to improve the weights between the high-frequency words and low-frequency words in the documents, and we use hash technology to improve the limitations of the theme of concept hierarchy. The method can identify the theme of texts accurately, and enhance the characteristic expression of texts. To a certain extent, it has resolved the semantic problem in specific areas.
  • Keywords
    feature extraction; pattern clustering; petroleum industry; text analysis; concept hierarchy; feature extraction; oil theme; petrochemical industry; text clustering; Application software; Computer science; Degradation; Educational institutions; Feature extraction; Frequency; Information technology; Ontologies; Petrochemicals; Petroleum; Feature Extraction; Text Clustering; Theme of Concept Hierarchy; Weight;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Technology Application Workshops, 2008. IITAW '08. International Symposium on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3505-0
  • Type

    conf

  • DOI
    10.1109/IITA.Workshops.2008.180
  • Filename
    4732018