Title :
Crowdsourced query augmentation through semantic discovery of domain-specific jargon
Author :
AlJadda, Khalifeh ; Korayem, Mohammed ; Grainger, Trey ; Russell, Craig
Author_Institution :
Dept. of Comput. Sci., Univ. of Georgia, Athens, GA, USA
Abstract :
Most work in semantic search has thus far focused upon either manually building language-specific taxonomies/ontologies or upon automatic techniques such as clustering or dimensionality reduction to discover latent semantic links within the content that is being searched. The former is very labor intensive and is hard to maintain, while the latter is prone to noise and may be hard for a human to understand or to interact with directly. We believe that the links between similar user´s queries represent a largely untapped source for discovering latent semantic relationships between search terms. The proposed system is capable of mining user search logs to discover semantic relationships between key phrases in a manner that is language agnostic, human understandable, and virtually noise-free.
Keywords :
data mining; ontologies (artificial intelligence); pattern clustering; query processing; clustering technique; crowdsourced query augmentation; dimensionality reduction technique; domain-specific jargon; language-specific ontology; language-specific taxonomy; latent semantic relationship discovery; semantic discovery; user query; user search log mining; Java; Natural language processing; Noise; Search engines; Search problems; Semantics; Software;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004310