DocumentCode
1662944
Title
Intelligent Extraction of Knowledge Structures from Natural Language Texts
Author
Kuznetsov, Igor P. ; Kozerenko, Elena B. ; Matskevich, Andrew G.
Author_Institution
Inst. for Inf. Problems, Russian Acad. of Sci., Moscow, Russia
Volume
3
fYear
2011
Firstpage
269
Lastpage
272
Abstract
A semantic linguistic processor which extracts the objects and their links from natural language texts is considered. It is intended for the areas where the automatic formalization of the flows of texts in natural language is required. Peculiarities of the texts are taken into account by linguistic knowledge of the processor: the system can be tuned to various subject areas. We describe the use of this processor for text formalization in different subject areas, such as criminology (summary of incidents, accusatory conclusions, etc.), mass media (documents about terrorist activities), personnel management (autobiographies, resume). Special features of each problem area are examined: the collections of extracted objects, the means for their identification, their connections, occurring contractions, punctuation and special signs, specific character of language constructions, etc. - all these special features were taken into account in the linguistic knowledge development.
Keywords
knowledge acquisition; linguistics; natural language processing; text analysis; accusatory conclusions; autobiographies; automatic formalization; criminology; knowledge structure intelligent extraction; language constructions; linguistic knowledge development; mass media; natural language texts; personnel management; resume; semantic linguistic processor; summary of incidents; text formalization; Conferences; Intelligent agents; data extraction; knowledge engineering; linguistic processor; natural language; semantics;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location
Lyon
Print_ISBN
978-1-4577-1373-6
Electronic_ISBN
978-0-7695-4513-4
Type
conf
DOI
10.1109/WI-IAT.2011.235
Filename
6040857
Link To Document