DocumentCode :
566561
Title :
An arabic information management messaging system: Using information extraction for the proper flow of information within organizations
Author :
Al-Anesi, Bushra Abdullah ; Thabit, Khalid Omar
Author_Institution :
Fac. of Comput. & Inf. Technol., King Abdulaziz Univ., Jeddah, Saudi Arabia
Volume :
1
fYear :
2012
fDate :
24-26 April 2012
Firstpage :
321
Lastpage :
326
Abstract :
The current method for passing information at King Abdulaziz University (KAU) is by using general announcements; whereby the general announcement is written on a certain topic and then forwarded on to all employees regardless of whether or not each employee has an interest or need for the general announcement. If the university were to decide to only send the general announcement out to only the interested parties, the job for the person assigned this task would be long and arduous. The proposed solution to this problem is to automate this process. This can be done through the use of the Information Management Messaging System (IMMS). The automation of this process will result in a savings in time and money for the organization. KAU´s general announcements are written in the Arabic language. Therefore, the IMMS will make use of the information extraction technology and apply it to Arabic text in the form of organizational general announcements. Little work on Arabic information extraction has been done. There were many obstacles to overcome when applying the first task of information extraction, which is the named-entity recognition task, to the Arabic language. The IMMS, however, performed fairly well when it was applied on Arabic general announcements. The IMMS received an f-measure score of 84% when applied to unstructured text and a score of 97% when applied to semi-structured text. The general announcements were made up of two types of text: unstructured text and semi-structured text. The IMMS recognizes a total of seven named-entities in unstructured text; they are: names of people, organizations, locations, Hijjri dates, Gregorian dates, web addresses and email addresses. In the semi-structured text, the IMMS recognizes a total of nineteen named-entities; they are: organizations, locations, job-titles, faculties, centers, departments, deanships, majors, administrations, job Hijjri dates (two dates), contact addresses (two types), job application Hijjri date- (two dates), contact number, interview Hijjri date, interview Gregorian date, and email addresses. Eight of the named-entities from the semi-structured text received f-measures of 100%.
Keywords :
educational administrative data processing; educational institutions; electronic messaging; information management; natural language processing; text analysis; Arabic general announcement; Arabic information extraction; Arabic information management messaging system; Arabic language; Arabic text; Gregorian dates recognition; King Abdulaziz University; Web addresses recognition; administration recognition; center recognition; contact address recognition; contact number recognition; deanship recognition; department recognition; email address recognition; email addresses recognition; f-measure score; faculties recognition; information passing; interview Gregorian date recognition; interview Hijjri date recognition; job Hijjri date recognition; job application Hijjri dates recognition; job-titles recognition; location recognition; named-entity recognition task; organization recognition; organizational general announcement; people name recognition; semistructured text; unstructured text; Electronic mail; Interviews; Organizations; Postal services; arabic named-entity recognition; information extraction; information management; information messaging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing Technology and Information Management (ICCM), 2012 8th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-0893-9
Type :
conf
Filename :
6268516
Link To Document :
بازگشت