• DocumentCode
    1748815
  • Title

    A SOM mapping technique for visualizing documents in a database

  • Author

    Morris, S.A. ; Wu, Z. ; Yen, G.

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Oklahoma State Univ., Stillwater, OK, USA
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    1914
  • Abstract
    A method is introduced for mapping documents, based on document citations, on a two dimensional map for clustering and visualization for the application of technology forecasting. The citation data is used to build an adjacency matrix which describes the document set as an undirected graph. The dimensionality of the adjacency matrix is reduced using principal components analysis. The reduced dimension data is used to train a small rectangular self organizing map (SOM). After training, each document´s input vector is premultiplied by the SOM weight matrix to find a spatial response across the SOM and the centroid of this response is used to map the document. The ordination method is demonstrated on a synthetic data set with good results. Further encouraging results using an actual 118 polymer document dataset are also shown
  • Keywords
    information retrieval; learning (artificial intelligence); principal component analysis; self-organising feature maps; SOM mapping technique; adjacency matrix; clustering; document citations; document database; document set; documents visualisation; ordination method; polymer document dataset; principal components analysis; rectangular self organizing map; spatial response; technology forecasting; two dimensional map; undirected graph; Application software; Data engineering; Data visualization; Frequency; Histograms; Organizing; Principal component analysis; Space technology; Technology forecasting; Visual databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-7044-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2001.938456
  • Filename
    938456