DocumentCode
1641854
Title
BioSumm: A novel summarizer oriented to biological information
Author
Baralis, Elena ; Fiori, Alessandro ; Montrucchio, Lorenzo
Author_Institution
Politec. di Torino, Torino
fYear
2008
Firstpage
1
Lastpage
6
Abstract
The availability of increasingly wider repositories of biomedical and biological texts requires effective techniques to manage the huge mass of unstructured information there contained. The availability of ad-hoc document summaries, targeted to specific topics, may assist researchers in inferring previously undisclosed knowledge and in performing the biological validation of the results of data mining analysis. This paper presents BioSumm, a flexible framework which analyzes large collections of unclassified biomedical texts and produces ad-hoc summaries oriented to inferring knowledge of gene/protein relationships. Summary generation is driven by a novel grading function, which biases sentence selection by means of an appropriate domain dictionary.
Keywords
bioinformatics; data mining; database management systems; dictionaries; document handling; BioSumm; biological information summarizer; biological text repository; biomedical text repository; data mining analysis; document summaries; domain dictionary; gene-protein relationships; grading function; unstructured information management; Availability; Data analysis; Data mining; Dictionaries; Indexing; Information retrieval; Navigation; Performance analysis; Petroleum; Proteins;
fLanguage
English
Publisher
ieee
Conference_Titel
BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
Conference_Location
Athens
Print_ISBN
978-1-4244-2844-1
Electronic_ISBN
978-1-4244-2845-8
Type
conf
DOI
10.1109/BIBE.2008.4696750
Filename
4696750
Link To Document