DocumentCode :
2544136
Title :
2009 CCPR Keynotes
Author :
Uszkoreit, H.
Author_Institution :
Saarland Univ., Saarbrucken, Germany
fYear :
2009
fDate :
4-6 Nov. 2009
Abstract :
Minimally supervised machine learning methods based on bootstrapping are an attractive approach to advanced information extraction. Complex patterns signalling relevant semantic relations in free texts can be detected in this way. However, the potential and limitations of such methods are not yet sufficiently understood. We have systematically analyzed a bootstrapping approach. The starting point of the analysis is a pattern-learning graph, which is a subgraph of the bipartite graph representing all connections between linguistic patterns and relation instances exhibited by the data. It is shown that the performance of such general learning framework for actual tasks is dependent on certain properties of the data and on the seed construction. However, the greatest improvements can be obtained through the systematic learning of negative patterns.
Keywords :
computational linguistics; graph theory; information retrieval; learning (artificial intelligence); text analysis; bipartite subgraph; bootstrapping approach; free text detection; information extraction; minimally supervised machine learning method; positive-negative pattern learning graph; relation extraction; seed construction; semantic relation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
Type :
conf
DOI :
10.1109/CCPR.2009.5344160
Filename :
5344160
Link To Document :
بازگشت