• DocumentCode
    130876
  • Title

    Adopting Text Clustering in web-based application to facilitate searching of education information

  • Author

    Nilsson, Nicklas ; Yan Liu

  • Author_Institution
    Sch. of Software Eng., Tongji Univ., Shanghai, China
  • fYear
    2014
  • fDate
    27-29 June 2014
  • Firstpage
    393
  • Lastpage
    396
  • Abstract
    Clustering, as a part of the Data Mining field, has been in the center of the research attention for the last decade. It is the task of finding subsets of data that are sharing the same type of attributes. Text Clustering becomes one of the most critical and important solutions in data mining to discover knowledge from fast grow up web data and log files. There are many challenges, algorithms needs to be tailored specific for each domain and scale well with growing data sets. Another interesting aspect is the design of the system. A complex set components need to interact well together. This article proposes an elegant way of clustering university educations based on their text attributes. The solution is integrated directly into a Spring Web Application. A comprehensive architecture is proposed, providing the frameworks needed. Clustering techniques such as Canopy Generation [4] and k-Means are demonstrated.
  • Keywords
    Web sites; data mining; educational administrative data processing; educational institutions; pattern clustering; text analysis; Web-based application; canopy generation; data mining; k-means clustering; knowledge discovery; text clustering; university education information searching; Clustering algorithms; Educational institutions; Indexes; Preforms; Servers; Apache Hadoop; Apache Ma-hout; Canopy Generation; Hibernate; Lucene Index; Spring; k-Means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
  • Conference_Location
    Beijing
  • ISSN
    2327-0586
  • Print_ISBN
    978-1-4799-3278-8
  • Type

    conf

  • DOI
    10.1109/ICSESS.2014.6933590
  • Filename
    6933590