• DocumentCode
    725743
  • Title

    Distributed LSI: Parallel preprocessing and vector sharing

  • Author

    Bradford, Roger B.

  • Author_Institution
    Agilex Technol., Chantilly, VA, USA
  • fYear
    2015
  • fDate
    27-29 May 2015
  • Firstpage
    191
  • Lastpage
    191
  • Abstract
    The technique of latent semantic indexing (LSI) has a wide variety of uses in intelligence and security informatics applications. LSI processing generates high-dimensional vectors that are used to represent individual items of interest and the features of which those items are composed. Historically, LSI representation vectors have been generated in a single computing environment (workstation, server, or VM instance). However, this is not a requirement. This paper describes two approaches to distributing elements of LSI processing. The first, parallelization of the preprocessing stage, can significantly decrease the time required for creation of LSI indexes. The second, vector sharing, can dramatically improve security in distributed LSI environments.
  • Keywords
    indexing; parallel processing; security of data; LSI processing; LSI representation vectors; distributed LSI environment security; distributed LSI technique; intelligence application; latent semantic indexing; parallel preprocessing; security informatics application; vector sharing; Conferences; Indexing; Large scale integration; Patents; Security; Semantics; LSI; folding-in; information sharing; latent semantic indexing; vector export; vector sharing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Security Informatics (ISI), 2015 IEEE International Conference on
  • Conference_Location
    Baltimore, MD
  • Print_ISBN
    978-1-4799-9888-3
  • Type

    conf

  • DOI
    10.1109/ISI.2015.7165973
  • Filename
    7165973