مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

1937649

Title :

Tagging text with a probabilistic model

Author :

Merialdo, Bernard

Author_Institution :

IBM France Sci. Center, Paris, France

fYear :

1991

fDate :

14-17 Apr 1991

Firstpage :

809

Abstract :

Experiments on the use of a probabilistic model to tag English text, that is, to assign to each word the correct tag (part of speech) in the context of the sentence, are presented. A simple triclass Markov model is used, and the best way to estimate the parameters of this model, depending on the kind and amount of training data that is provided, is found. Two approaches are compared: the use of text that has been tagged by hand and comparing relative frequency counts; and use text without tags and training the model as a hidden Markov process, according to a maximum likelihood principle. Experiments show that the best training is obtained by using as much tagged text as is available, a maximum likelihood training may improve the accuracy of the tagging

Keywords :

Markov processes; probability; speech analysis and processing; English text tagging; hidden Markov process; maximum likelihood training; parameter estimation; probabilistic model; sentence; speech; tagging accuracy; training data; triclass Markov model; Context modeling; Frequency; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Performance evaluation; Speech; Tagging; Training data; Viterbi algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location :

Toronto, Ont.

ISSN :

1520-6149

Print_ISBN :

0-7803-0003-3

Type :

conf

DOI :

10.1109/ICASSP.1991.150460

Filename :

150460

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1937649