Title :
Unsupervised feature selection for linked data
Author :
Nemade, Rachana T. ; Makhijani, R.
Author_Institution :
Dept. of Comput. Sci. & Eng., S.S.G.B. Coll. of Eng. & Tech., Bhusawal, India
Abstract :
The widespread use of social media web sites gives high dimensional linked data. For limiting the amount and dimensionality of the data, feature subset selection is an effective way which selects features that correlate well with the target class. The high dimensional linked data from social media web sites lacks the availability of label information. So feature selection for linked data remains a challenging task. By using the link information feature relevance assessment is done. In this paper, we propose the unsupervised feature selection from linked data, UFSLD algorithm. The UFSLD algorithm works in three steps. In the first step, the interdependency among the linked data is exploited and the relevant features are selected. In the second step, the features from first step are classified to form the clusters by using graph-theoretic clustering method. In the third step, the most representative feature from each cluster is selected to form a subset of features. MST clustering method is used to ensure the efficiency of this algorithm. Experiments are conducted to compare UFSLD with one unsupervised and another supervised feature selection algorithm and the effectiveness of this algorithm is evaluated.
Keywords :
feature selection; graph theory; pattern clustering; social networking (online); MST clustering method; UFSLD algorithm; data dimensionality; feature subset selection; graph-theoretic clustering method; link information feature relevance assessment; linked data; social media Web sites; unsupervised feature selection; Accuracy; Blogs; Clustering algorithms; Feature Selection; Graph-based clustering; Linked data; clustering;
Conference_Titel :
Recent Advances and Innovations in Engineering (ICRAIE), 2014
Conference_Location :
Jaipur
Print_ISBN :
978-1-4799-4041-7
DOI :
10.1109/ICRAIE.2014.6909131