DocumentCode :
2584210
Title :
Collecting positive instances of “instance-of” relationship in the Persian language
Author :
Rastegari, Yousef ; Abolhassani, Hassan ; Zibanezhad, Bahareh ; Sayadiharikandeh, Mohsen
Author_Institution :
South Tehran Branch, Islamic Azad Univ., Tehran, Iran
fYear :
2010
fDate :
7-10 May 2010
Firstpage :
46
Lastpage :
49
Abstract :
Fetching Lexico-Syntactic patterns from text rely on pairs of words (positive instances) that represent the target relation, and finding their simultaneous occurrence in text corpus. Due to existence of WordNet thesaurus (which contains the semantic relationship between words), collecting positive instances is easy. In non-english languages, it´s hard to collect large number of positive instances in various contexts. We investigated some new ideas for collecting them in Persian language and finally run the best one and collected approximately 6,000 positive instances.
Keywords :
natural language processing; ontologies (artificial intelligence); Persian language; WordNet thesaurus; instance-of relationship; lexico-syntactic patterns; nonEnglish language; positive instance collection; text corpus; Dictionaries; Information analysis; Internet; Ontologies; Pattern analysis; Search engines; Semantic Web; Thesauri; Web pages; XML; instance-of relationship; lexico-syntactic patterns; ontology learning and population; persian language; positive instances; semantic web; text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronic Computer Technology (ICECT), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7404-2
Electronic_ISBN :
978-1-4244-7406-6
Type :
conf
DOI :
10.1109/ICECTECH.2010.5479991
Filename :
5479991
Link To Document :
بازگشت