• DocumentCode
    3222451
  • Title

    Application of multilayer perceptron network for tagging parts-of-speech

  • Author

    Ahmed ; Raju, S. Bapi ; Chandrasekhar, Pammi V S ; Prasad, M. Krishna

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Muffakham Jah Coll. of Eng. & Technol., Hyderabad, India
  • fYear
    2002
  • fDate
    13-15 Dec. 2002
  • Firstpage
    57
  • Lastpage
    63
  • Abstract
    This paper presents a neural network based part-of-speech tagger that learns to assign correct part-of-speech tags to the words in a sentence. A multilayer perceptron (MLP) network with three-layers is used. The MLP-tagger is trained with error back-propagation learning algorithm. The representation scheme for the input and output of the network is adapted from Ma et al. (1966). The tagger is trained on SUSANNE English tagged-corpus consisting of 156,622 words. The MLP-tagger is trained using 85% of the corpus. Based on the tag mappings learned, the MLP-tagger demonstrated an accuracy of 90.04% on test data that also included words unseen during the training. Results from our experiments suggest that the MLP-tagger combined with the representation scheme adopted here could be a better substitute for traditional tagging approaches. This method shows promise for addressing parts-of-speech tagging problem for Indian language text considering the fact that most of the Indian language corpora, especially tagged ones, are still considerably small in size.
  • Keywords
    backpropagation; feedforward neural nets; multilayer perceptrons; speech recognition; Indian language corpora; Indian language text; SUSANNE English tagged-corpus; error back-propagation learning algorithm; multilayer perceptron network; neural network based part-of-speech tagger; representation scheme; Application software; Computer errors; Computer networks; Computer science; Educational institutions; Hidden Markov models; Multilayer perceptrons; Neural networks; Tagging; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Language Engineering Conference, 2002. Proceedings
  • Print_ISBN
    0-7695-1885-0
  • Type

    conf

  • DOI
    10.1109/LEC.2002.1182291
  • Filename
    1182291