Title :
Content extraction of Quran Interpretation (Tafseer) books for digital content creation and distribution
Author :
Menacer, Mohamed ; Arbaoui, A.
Author_Institution :
NOOR Res. Centre, Taibah Univ., Al-Madinah, Saudi Arabia
Abstract :
This paper presents concepts for relevant content extract of known and prominent Quran interpretation (Tafseer) books. The extracted content can be used efficiently in building, creating and distributing digital and multimedia content on Quran, Tafseer and Islamic issues. Due to the uniqueness of Quran and most renown Tafseer books, extracting relevant information in a structured manner and with accuracy is a quite delicate matter, because of the important and sensitive issues being dealt with. Natural Language processing techniques for automatic information retrieval and extraction are not reliable and desirable approach in this case, due to the level of inaccuracy and objectivity involved, which is not tolerated for such highly referenced books for muslims. The aim of this paper is to propose a systematic approach into extracting and collecting the most relevant information in a structured manner from Tafseer books that are useful for academic purposes as well as for general use. Al Asfahani Tafseer book, `Mufradat fi Gharib al-Quran´, has been chosen as in this case. Building more digital content details of the book would allow for better search as well as further development into related authoring and indexing. Overall concepts of the content extraction approach is presented in this paper with the different phases involved.
Keywords :
electronic publishing; information retrieval; multimedia computing; natural language processing; Al Asfahani Tafseer book; Islamic issues; Mufradat fi Gharib al-Quran; Muslims; Quran interpretation Tafseer books; Quran issues; Tafseer issues; automatic information extraction; automatic information retrieval; content extraction; content extraction approach; digital content creation; digital content details; digital content distribution; distributing digital content; multimedia content; natural language processing techniques; Authentication; Buildings; Data mining; Databases; Educational institutions; Internet; Multimedia communication; Content distribution; Content extraction; Digital content creation; Quran; Quran Sciences; Tafseer books; multimedia;
Conference_Titel :
Computer and Information Technology (WCCIT), 2013 World Congress on
Conference_Location :
Sousse
Print_ISBN :
978-1-4799-0460-0
DOI :
10.1109/WCCIT.2013.6618750