English-Hindi Automatic Word Alignment with Scarce Resources

Author

Venkataramani, Eknath ; Gupta, Deepa

Author_Institution

Dept. of Inf. Technol., Amrita Vishwa Vidyapeetham, Bangalore, India

fYear

2010

fDate

28-30 Dec. 2010

Firstpage

253

Lastpage

256

Abstract

Many automatic word alignment techniques have been so far developed in Natural Language Processing (NLP). However, word alignment between English and Hindi has not progressed much due to two main reasons viz. complex structure of the participating languages and the scarcity of Hindi-language resources. This paper provides a corpus-augmented method of word alignment in which these limitations have been overcome. We see this work as an improved approach in establishing a word alignment algorithm with scarce resources for Indian languages in general and for English-Hindi in particular.

Keywords

natural language processing; word processing; English-Hindi automatic word alignment; Hindi language resource scarcity; Indian languages; corpus augmented method; natural language processing; Computational linguistics; Conferences; Data models; Dictionaries; Hidden Markov models; Training; Training data; Giza++; NATools; Scarce resources; Word alignment; corpus-augmented approach;

fLanguage

English

Publisher

ieee

Conference_Titel

Asian Language Processing (IALP), 2010 International Conference on

Conference_Location

Harbin

Print_ISBN

978-1-4244-9063-9

Type

conf

DOI

10.1109/IALP.2010.54

Filename

5681567

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1954538