Title :
Developing a tagset for Pashto part of speech tagging
Author :
Rabbi, Ihsan ; Khan, Mohammad Abid ; Ali, Rahman
Author_Institution :
Dept. of Comput. Sci., Univ. of Peshawar, Peshawar
Abstract :
While building a machine translation system, the embedded part-of-speech (POS) tagger deserves special attention. The ever first tagset discussed here is created in accordance with the EAGLES guidelines. These guidelines were written for the languages of European Union. They can also be applied to Pashto language. This paper presents the creation process of Pashto tagset, which helps in the development of a POS tagger.
Keywords :
language translation; natural languages; speech recognition; EAGLES guideline; Pashto language; machine translation system; speech tagging; Computational linguistics; Computer science; Educational institutions; Encyclopedias; Guidelines; Natural languages; Speech processing; Tagging;
Conference_Titel :
Electrical Engineering, 2008. ICEE 2008. Second International Conference on
Conference_Location :
Lahore
Print_ISBN :
978-1-4244-2292-0
Electronic_ISBN :
978-1-4244-2293-7
DOI :
10.1109/ICEE.2008.4553909