DocumentCode :
703744
Title :
TnT tagger with fuzzy rule based learning
Author :
Jacob, Alen ; Babu, Amal ; Reghu Raj, P.C.
Author_Institution :
M.Tech Comput. Linguistics, Gov. Eng. Coll., Palakkad, India
fYear :
2015
fDate :
19-21 Feb. 2015
Firstpage :
1
Lastpage :
5
Abstract :
TnT is an efficient statistical Parts-of-speech (POS) Tagger based on Hidden Markov Model. TnT stands for Trigrams`n´Tags. Viterbi algorithm is used for finding the best tag sequence for a given observation sequence of words. TnT performs well on known word sequences. But, the performance degrades with increase in the number of unknown words. In this paper, we propose a method to overcome this performance degradation using fuzzy rules. Fuzzy rule based model is designed to provide TnT with sufficient information about the tag of unknown words without degrading the performance of TnT. When TnT with fuzzy rule based learning encounters an unknown word, the TnT generates a set of possible tags for the given word, based on the fuzzy rules matched by the word. If the word does not match any fuzzy rule, then the model depends upon the probability distribution of the suffix. This approach guarantees that the performance of TnT will only be improved from its normal performance.
Keywords :
fuzzy set theory; hidden Markov models; learning (artificial intelligence); natural language processing; TnT tagger; Viterbi algorithm; fuzzy rule based learning; hidden Markov model; statistical parts-of-speech tagger; Computational linguistics; Context; Hidden Markov models; Natural language processing; Predictive models; Tagging; Training; EM Algorithm; Fuzzy Membership; Fuzzy Sets; Hidden Markov Model; TnT Tagger; Viterbi Algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, Informatics, Communication and Energy Systems (SPICES), 2015 IEEE International Conference on
Conference_Location :
Kozhikode
Type :
conf
DOI :
10.1109/SPICES.2015.7091511
Filename :
7091511
Link To Document :
بازگشت