DocumentCode
1954538
Title
English-Hindi Automatic Word Alignment with Scarce Resources
Author
Venkataramani, Eknath ; Gupta, Deepa
Author_Institution
Dept. of Inf. Technol., Amrita Vishwa Vidyapeetham, Bangalore, India
fYear
2010
fDate
28-30 Dec. 2010
Firstpage
253
Lastpage
256
Abstract
Many automatic word alignment techniques have been so far developed in Natural Language Processing (NLP). However, word alignment between English and Hindi has not progressed much due to two main reasons viz. complex structure of the participating languages and the scarcity of Hindi-language resources. This paper provides a corpus-augmented method of word alignment in which these limitations have been overcome. We see this work as an improved approach in establishing a word alignment algorithm with scarce resources for Indian languages in general and for English-Hindi in particular.
Keywords
natural language processing; word processing; English-Hindi automatic word alignment; Hindi language resource scarcity; Indian languages; corpus augmented method; natural language processing; Computational linguistics; Conferences; Data models; Dictionaries; Hidden Markov models; Training; Training data; Giza++; NATools; Scarce resources; Word alignment; corpus-augmented approach;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location
Harbin
Print_ISBN
978-1-4244-9063-9
Type
conf
DOI
10.1109/IALP.2010.54
Filename
5681567
Link To Document