DocumentCode :
397574
Title :
Extended multi-word trigger pair language model using data mining technique
Author :
Chen, Yong ; Chan, Kwok-Ping
Author_Institution :
Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
Volume :
1
fYear :
2003
fDate :
5-8 Oct. 2003
Firstpage :
262
Abstract :
A good language model is essential to a postprocessing algorithm for recognition systems. Trigger pair model has been used to investigate long distance dependent relationship. However, previous trigger pair model has only one word for its trigger. It is desirable that more words can be observed in the trigger for a better prediction of the triggered word. In this work, we view establishing trigger pair model as mining association rules in a large database and create a multiple words trigger pair model by using an adapted A priori algorithm. The new trigger pair model can be used in the stage of finding best path from a word lattice as traditional trigger pair model can. Specially, it can be used to correct mistakes remaining in the final result as well. Those mistakes would be unavoidable for other language models.
Keywords :
character recognition; data mining; natural languages; probability; adapted A priori algorithm; data mining; database; extended multiword trigger pair language model; mining association rules; postprocessing algorithm; probability; recognition systems; trigger pair model; triggered word; Association rules; Computer science; Data mining; Databases; History; Information systems; Lattices; Natural languages; Pattern recognition; Predictive models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-7952-7
Type :
conf
DOI :
10.1109/ICSMC.2003.1243826
Filename :
1243826
Link To Document :
بازگشت