DocumentCode :
323519
Title :
Maximum likelihood and discriminative training of direct translation models
Author :
Papineni, K.A. ; Roukos, S. ; Ward, R.T.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
189
Abstract :
We consider translating natural language sentences into a formal language using direct translation models built automatically from training data. Direct translation models have three components: an arbitrary prior conditional probability distribution, features that capture correlations between automatically determined key phrases or sets of words in both languages, and weights associated with these features. The features and the weights are selected using a training corpus of matched pairs of source and target language sentences to maximize the entropy or a new discrimination measure of the resulting conditional probability model. We report results in the air travel information system domain and compare the two methods of training
Keywords :
correlation methods; feature extraction; formal languages; language translation; learning systems; maximum entropy methods; natural languages; pattern matching; probability; traffic information systems; air travel information system; correlations; direct translation models; discriminative training; feature selection; formal language; key phrases; maximum entropy; natural language; pattern matching; probability distribution; word sets; Context modeling; Databases; Electronic mail; Entropy; Formal languages; Hidden Markov models; Information systems; Natural languages; Probability; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674399
Filename :
674399
Link To Document :
بازگشت