DocumentCode :
3387573
Title :
Research and implementation of part-of-speech tagging based on Hidden Markov Model
Author :
Youzhi, Zhang
Author_Institution :
Sch. of Comput. & Inf., Anqing Teachers´´ Coll., Anqing, China
Volume :
2
fYear :
2009
fDate :
28-29 Nov. 2009
Firstpage :
26
Lastpage :
29
Abstract :
As separate problems of English, MA, POS, PDR can be considered independent with each other. In a practical research system, they are dependent, solution of the prior one forms the base for processing the next one. We Consider different features of these problems, after a comprehensive study, a divide-and-conqueror strategy is proposed and resolves them separately. First, a knowledge-based method is put forward for the solution of MA. The whole MA processing is completed by many subordinate functions dealing with different particular marks of English words. A strategy of combining the word length with statistic enumeration is developed to distinguish between the periods and abbreviations. Then, an approach combining Rule-based method with Hidden Markov Model (HMM) is put forward for POS tagging. Rule is introduced prior to the HMM approach not only to lower the time cost, but also to resolve the problems that cannot be solved with HMM. Solution to the POS tagging with this approach reports an accuracy of 99.83%.
Keywords :
hidden Markov models; speech processing; english words; hidden Markov model; knowledge-based method; morphological analysis; part-of-speech tagging; statistic enumeration; word length; Computational intelligence; Dictionaries; Hidden Markov models; Information analysis; Probability density function; Random processes; Signal processing; Speech analysis; Speech recognition; Tagging; Hidden Markov Model; morphological analysis; part-of-speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Industrial Applications, 2009. PACIIA 2009. Asia-Pacific Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4606-3
Type :
conf
DOI :
10.1109/PACIIA.2009.5406648
Filename :
5406648
Link To Document :
بازگشت