Title :
Segmentation of Legislative Documents Using a Domain-Specific Lexicon
Author :
Hasan, Ismael ; Parapar, Javier ; Blanco, Roi
Author_Institution :
Dept. of Comput. Sci., A Coruna Univ., A Coruna
Abstract :
The amount of legal information is continuously growing. New legislative documents appear everyday in the Web. Legal documents are produced on a daily basis in briefing-format, containing changes in the current legislation, notifications, decisions, resolutions, etc. The scope of these documents includes countries, states, provinces and even city councils. This legal information is produced in a semi-structured format and distributed daily on official Web-sites; however, the huge amount of published information makes difficult for an user to find a specific issue, being lawyers probably the most representative example, who need to access to these sources regularly. This motivates the need of legislative information search engines. Standard general Web search engines return to the user full documents (Web pages typically), within hundreds of pages. As users expect only the relevant part of the document, techniques that recognise and extract these relevant bits of documents are needed to offer quick and effective results. In this paper we present a method to perform segmentation based on domain-specific lexicon information. Our method was tested with a manually tagged data-set coming from different sources of Spanish legislative documents. Results show that this technique is suitable for the task achieving values of 97´85% recall and 95´99% precision.
Keywords :
Internet; document handling; information retrieval; law administration; search engines; domain-specific lexicon information; information retrieval; legal information processing; legislative document segmentation; legislative information search engine; official Web site; Cities and towns; Councils; Data mining; Law; Legal factors; Legislation; Search engines; Testing; Web pages; Web search; Legislative documents; domain lexicon; segmentation;
Conference_Titel :
Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
Conference_Location :
Turin
Print_ISBN :
978-0-7695-3299-8
DOI :
10.1109/DEXA.2008.45