Title :
Finding Answer Passages with Rank Optimizing Decision Trees
Author_Institution :
Intell. Inf. & Commun. Syst. Group (IICS), Univ. of Hagen, Hagen, Germany
Abstract :
The paper discusses the use of decision trees for probability-based ranking. Emphasis is placed on ranking problems in question answering, where the frequency of correct candidates is very low but a single correct answer at one of the top ranks is often sufficient. Since existing tree learners handle this task poorly, decision tree induction is reformulated in such a way that it directly optimizes a given measure of ranking quality (such as mean reciprocal rank or mean average precision). This change also makes it possible to incorporate a priori knowledge about the positive or negative effect of an attribute on ranking quality. Results are further improved by applying a stratified form of bagging. In a passage reranking task using factoid questions from the QA.CLEF evaluations, the new method outperforms existing tree induction techniques by a large margin.
Keywords :
decision trees; information retrieval; probability; answer passages; decision tree induction; mean average precision; mean reciprocal rank; passage reranking task; probability-based ranking; question answering system; Bagging; Current measurement; Decision trees; Frequency estimation; Induction generators; Information retrieval; Learning systems; Machine learning; Positron emission tomography; Training data; focused retrieval; learning to rank; mean reciprocal rank (MRR); probability estimation tree (PET); question answering (QA);
Conference_Titel :
Machine Learning and Applications, 2009. ICMLA '09. International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
978-0-7695-3926-3
DOI :
10.1109/ICMLA.2009.30