DocumentCode :
1878187
Title :
To stop or not to stop — Experiments on stopword elimination for information retrieval of Gujarati text documents
Author :
Joshi, Harshita ; Pareek, Jyoti ; Patel, Rahul ; Chauhan, K.
fYear :
2012
fDate :
6-8 Dec. 2012
Firstpage :
1
Lastpage :
4
Abstract :
Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.
Keywords :
indexing; information retrieval; natural language processing; text analysis; Gujarati IR; Gujarati stopwords; Gujarati text documents; MAP; adhoc monolingual information retrieval; document indexing; information retrieval tasks; mean average precision; stopword elimination; Automatic Indexing; Corpus; Gujarati Information Retrieval; Mean Average Precision; Stopwords;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering (NUiCONE), 2012 Nirma University International Conference on
Conference_Location :
Ahmedabad
Print_ISBN :
978-1-4673-1720-7
Type :
conf
DOI :
10.1109/NUICONE.2012.6493219
Filename :
6493219
Link To Document :
بازگشت