Title :
Expanded Semantic Graph Representation for Matching Related Information of Interest across Free Text Documents
Author :
Johnson, James R. ; Miller, Alice ; Khan, Latifur ; Thuraisingham, Bhavani
Author_Institution :
ADB Consulting, LLC, Reno, NV, USA
Abstract :
This research proposes an expanded semantic graph definition that serves as a basis for an expanded semantic graph representation and graph matching approach. This representation separates the content and context and adds a number of semantic structures that encapsulate inferred information. The expanded semantic graph approach facilitates finding additional matches, identifying and eliminating poor matches, and prioritizing matches based on how much new knowledge is provided. By focusing on information of interest, doing pre-processing, and reducing processing requirements, the approach is applicable to problems where related information of interest is sought across a massive body of free text documents. Key aspects of the approach include (1) expanding the nodes and edges through inference using DL-Safe rules, abductive hypotheses, and syntactic patterns, (2) separating semantic content into nodes and semantic context into edges, and (3) applying relatedness measures on a node, edge, and sub graph basis. Results from tests using a ground-truthed subset of a large dataset of law enforcement investigator emails illustrate the benefits of these approaches.
Keywords :
content management; graph theory; inference mechanisms; information retrieval; text analysis; DL-Safe rules; abductive hypotheses; expanded semantic graph representation; free text documents; graph edges; graph matching; graph nodes; ground-truthed subset; inference; information of interest; law enforcement investigator email dataset; poor match elimination; poor match identification; processing requirement reduction; semantic content separation; semantic context; semantic structures; subgraph; syntactic patterns; Context; Electronic mail; Law enforcement; Ontologies; Semantics; Syntactics; Vectors; adjacency matrix; directed attributed graph; graph matching; information of interest; node attribute list; relatedness measures; semantic graph; semantic information structure;
Conference_Titel :
Semantic Computing (ICSC), 2012 IEEE Sixth International Conference on
Conference_Location :
Palermo
Print_ISBN :
978-1-4673-4433-3
DOI :
10.1109/ICSC.2012.45