مرکز منطقه ای اطلاع رساني علوم و فناوري - Agglomerative algorithm to discover semantics from unstructured big data

DocumentCode :

3717301

Title :

Agglomerative algorithm to discover semantics from unstructured big data

Author :

I-Jen Chiang

Author_Institution :

College of Business Administration, Taipei Medical University, Taipei, Taiwan 110

fYear :

2015

Firstpage :

1556

Lastpage :

1563

Abstract :

The paper presents a graph model and an agglomerative algorithm for text document clustering. Given a set of documents, the associations among frequently co-occurring terms in any of the documents naturally form a graph, which can be decomposed into connected components at various levels. Each connected component represents a concept in the collection. These concepts can categorize documents into different semantic classes. The experiments on three different data sets from news, Web, and medical literatures have shown our algorithm is significantly better than traditional clustering algorithms, such as k-means, principal direction division partitioning, AutoClass and hierarchical clustering.

Keywords :

"Itemsets","Association rules","Clustering algorithms","Semantics","Feature extraction","Partitioning algorithms","Big data"

Publisher :

ieee

Conference_Titel :

Big Data (Big Data), 2015 IEEE International Conference on

Type :

conf

DOI :

10.1109/BigData.2015.7363920

Filename :

7363920

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3717301