Title :
Clustering of biomedical scientific papers
Author :
Bravo-Alcobendas, D. ; Sorzano, C.O.S.
Author_Institution :
Bioeng. Lab., Univ. San Pablo, Boadilla del Monte, Spain
Abstract :
In this paper we present a methodology for document clustering based on non-negative matrix factorization (NMF) and ensemble clustering. Thanks to the ensemble clustering the algorithm is less prone to get into a local minimum caused by the initialization of the NMF. Despite the ensemble clustering, the algorithm keeps the semantic interpretability of the NMF and constructs a coocurrence matrix that allows the projection of the documents onto a two-dimensional space suitable for visualization. The algorithm is freely available for the information retrieval community from the bioengineering laboratory Web page.
Keywords :
Web sites; data visualisation; information retrieval; matrix decomposition; medical information systems; pattern clustering; Web page; bioengineering laboratory; biomedical scientific papers; coocurrence matrix; document clustering; ensemble clustering; information retrieval; nonnegative matrix factorization; visualization; Biomedical engineering; Biomedical signal processing; Clustering algorithms; Information retrieval; Iterative algorithms; Matrix decomposition; Publishing; Signal processing algorithms; Visualization; Web pages;
Conference_Titel :
Intelligent Signal Processing, 2009. WISP 2009. IEEE International Symposium on
Conference_Location :
Budapest
Print_ISBN :
978-1-4244-5057-2
Electronic_ISBN :
978-1-4244-5059-6
DOI :
10.1109/WISP.2009.5286530