Title :
Challenges in Information Retrieval from Unstructured Arabic Data
Author :
Khalil, Hussein ; Osman, Taha
Author_Institution :
Sch. of Sci. & Technol., Nottingham Trent Univ., Nottingham, UK
Abstract :
The main issue that currently faces research in the information society is the flood of information; a problem exacerbated by the massive diversity of information on the World Wide Web. It has given researchers access to millions of references, articles, news and services. Regardless of geographic location and language used, much of this information is unstructured data. There is a large body of research on mining unstructured Web data, but little effort for Web pages authored in Arabic. This paper investigates the Semantic Web (SW) support for handling documents that are authored and/or annotated in Arabic, and how to bridge the gap between the SW and Natural Language Processing (NLP). Moreover, to improve the intelligent exploration of unstructured documents in the Arabic domain.
Keywords :
data mining; document handling; information retrieval; natural language processing; semantic Web; NLP; SW; Web pages; World Wide Web; document handling; information retrieval; natural language processing; semantic Web; unstructured Arabic data; unstructured Web data mining; Data mining; Educational institutions; Information retrieval; Natural language processing; Ontologies; Ports (Computers); Semantics; NLP; Semantic Web; Text Mining; Information Retrieval; Ontology Engineering; Knowledgebase;
Conference_Titel :
Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on
Print_ISBN :
978-1-4799-4923-6
DOI :
10.1109/UKSim.2014.115