Title :
Twitter part-of-speech tagging using pre-classification Hidden Markov model
Author :
Sun, Shichang ; Liu, Hongbo ; Lin, Hongfei ; Abraham, Ajith
Abstract :
Hidden Markov models (HMM) have been widely used in natural language processing (NLP), especially in syntactic level applications, which appears naturally as short-range-dependent sequence recognition problems. But the structure of HMM limits the usage of global knowledge including the sentiment analysis of the text, which has become an increasingly popular research topic in NLP now. In this paper, we propose a novel treatment of HMM model to use the result of sentimental subjectivity analysis in syntactic level task, i.e. part-of-speech (POS) tagging. The subjectivity information is introduced as a pre-classification procedure into the interval-type HMM. The subjectivity degree of the testing sentence is used as a combination factor to choose an appropriate value from the interval. Experiments results on public tagging data sets shows that the proposed approach enhanced the performance of POS tagging.
Keywords :
hidden Markov models; natural language processing; social networking (online); text analysis; HMM; NLP; POS tagging; Twitter part-of-speech tagging; global knowledge; interval-type HMM; natural language processing; preclassification hidden Markov model; public tagging data sets; sentimental subjectivity analysis; short-range-dependent sequence recognition problems; syntactic level applications; testing sentence subjectivity degree; text sentiment analysis; Analytical models; Data models; Hidden Markov models; Natural language processing; Niobium; Tagging; Training; Hidden Markov models; Naive Bayes model; Part-of-speech tagging; Subjectivity analysis;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
DOI :
10.1109/ICSMC.2012.6377881