Title :
Linguistic text mining for problem reports
Author :
Malin, Jane T. ; Millward, Christopher ; Schwarz, Hansen A. ; Gomez, Fernando ; Throop, David R. ; Thronesbery, Carroll
Author_Institution :
Robot. & Simulation Div., NASA Johnson Space Center, Houston, TX, USA
Abstract :
This paper describes a linguistic text mining tool for analyzing problem reports in aerospace engineering and safety organizations. The semantic trend analysis tool (STAT) helps analysts find and review recurrences, similarities and trends in problem reports. The tool is being used to analyze engineering discrepancy reports at NASA Johnson Space Center. The tool has been augmented with a statistical natural language parser that also resolves parsing gaps and identifies verb arguments and adjuncts. The tool uses an aerospace ontology augmented with features of taxonomies and thesauruses. The ontology defines hierarchies of problem types, equipment types and function types. STAT uses the output of the parser and the aerospace ontology to identify words and phrases in problem report descriptions that refer to types of hazards, equipment damage, performance deviations or functional impairments. Tool performance has been evaluated on 120 problem descriptions from problem reports, with encouraging results.
Keywords :
aerospace engineering; aerospace safety; computational linguistics; data mining; grammars; natural languages; ontologies (artificial intelligence); statistical analysis; text analysis; thesauri; NASA Johnson Space Center; STAT; aerospace engineering discrepancy report; aerospace ontology; aerospace safety organization; equipment damage; functional impairment; linguistic text mining tool; problem report analyzing; semantic trend analysis tool; statistical natural language parser; taxonomy; thesaurus; verb adjunct; verb argument; Aerospace engineering; Aerospace safety; Data mining; Hazards; NASA; Natural languages; Ontologies; Software safety; Text mining; USA Councils; knowledge discovery; natural language understanding; ontology; text mining;
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2009.5346056