DocumentCode :
2570681
Title :
Linking corpus characteristics to performance of semantic annotation systems for biosystematic descriptions
Author :
Cui, Hong
Author_Institution :
Sch. of Inf. Resources & Libr. Sci., Univ. of Arizona, Tucson, AZ, USA
fYear :
2010
fDate :
16-18 April 2010
Firstpage :
92
Lastpage :
96
Abstract :
Digitizing and repurposing taxonomic descriptions of living organisms is an urgent task facing biodiversity informatics researchers. Semantic annotation is the essential technology that makes taxonomic descriptions´ reuse and repurpose possible. However, annotation systems performance often vary by collections. Given large content and structural variations inherent in different collections of taxonomic descriptions, this paper looks into corpus characteristic measures in an attempt to establish a performance prediction model which, when given a small set of samples, predicts a system´s performance for a collection. The predication model helps deepen our understanding of strengths and weaknesses of an annotation system, but more importantly provides a valuable decision-making tool for end users. We started this research by using MARTT (Markuper for Taxonomic Treatments) system as a base. Although an universal performance predication model for all systems and all corpora may not be possible at this time, we hope more and more individual systems would offer such tools as a regular component in their delivery package.
Keywords :
bioinformatics; data mining; learning (artificial intelligence); text analysis; MARTT; Markuper for Taxonomic Treatments; biosystematic description; corpus characteristics; decision-making tool; performance evauation; performance prediction; predication model; semantic annotation; taxonomic description; Accelerometers; Biomedical monitoring; Cardiovascular diseases; Joining processes; Real time systems; Sensor systems; Sun; Thigh; Wearable sensors; Wireless sensor networks; corpus characteristics; performance evauation; performance prediction; semantic annotation systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Technology (ICBBT), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6775-4
Type :
conf
DOI :
10.1109/ICBBT.2010.5479002
Filename :
5479002
Link To Document :
بازگشت