• DocumentCode
    1954538
  • Title

    English-Hindi Automatic Word Alignment with Scarce Resources

  • Author

    Venkataramani, Eknath ; Gupta, Deepa

  • Author_Institution
    Dept. of Inf. Technol., Amrita Vishwa Vidyapeetham, Bangalore, India
  • fYear
    2010
  • fDate
    28-30 Dec. 2010
  • Firstpage
    253
  • Lastpage
    256
  • Abstract
    Many automatic word alignment techniques have been so far developed in Natural Language Processing (NLP). However, word alignment between English and Hindi has not progressed much due to two main reasons viz. complex structure of the participating languages and the scarcity of Hindi-language resources. This paper provides a corpus-augmented method of word alignment in which these limitations have been overcome. We see this work as an improved approach in establishing a word alignment algorithm with scarce resources for Indian languages in general and for English-Hindi in particular.
  • Keywords
    natural language processing; word processing; English-Hindi automatic word alignment; Hindi language resource scarcity; Indian languages; corpus augmented method; natural language processing; Computational linguistics; Conferences; Data models; Dictionaries; Hidden Markov models; Training; Training data; Giza++; NATools; Scarce resources; Word alignment; corpus-augmented approach;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2010 International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4244-9063-9
  • Type

    conf

  • DOI
    10.1109/IALP.2010.54
  • Filename
    5681567