Title of article :
Preprocessing and Morphological Analysis in Text Mining
Author/Authors :
Mohbey، Krishna Kumar نويسنده Samrat Ashok Technological Institute, Vidisha , , Tiwari، Sachin نويسنده Samrat Ashok Technological Institute, Vidisha ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Abstract :
This paper is based on the preprocessing
activities which is performed by the software or
language translators before applying mining
algorithms on the huge data. Text mining is an
important area of Data mining and it plays a vital role
for extracting useful information from the huge
database or data ware house. But before applying the
text mining or information extraction process,
preprocessing is must because the given data or dataset
have the noisy, incomplete, inconsistent, dirty and
unformatted data. In this paper we try to collect the
necessary requirements for preprocessing. When we
complete the preprocess task then we can easily extract
the knowledgful information using mining strategy.
This paper also provides the information about the
analysis of data like tokenization, stemming and
semantic analysis like phrase recognition and parsing.
This paper also collect the procedures for
preprocessing data i.e. it describe that how the
stemming, tokenization or parsing are applied
Journal title :
International Journal of Electronics Communication and Computer Engineering
Journal title :
International Journal of Electronics Communication and Computer Engineering