مرکز منطقه ای اطلاع رساني علوم و فناوري - A New Alignment Algorithm to Identify Definitions Corresponding to Abbreviations in Biomedical Text

DocumentCode :

472429

Title :

A New Alignment Algorithm to Identify Definitions Corresponding to Abbreviations in Biomedical Text

Author :

Xu, Yun ; Wang, ZhiHao ; Zhao, Yuzhong ; Xue, Yu

Author_Institution :

Univ. of Sci. & Technol. of China, Hefei

fYear :

2008

fDate :

23-24 Jan. 2008

Firstpage :

118

Lastpage :

124

Abstract :

The exploding growth of the biomedical literature presents many challenges for biological researchers. One such challenge is from the use of a great deal of abbreviations. Extracting abbreviations and their definitions accurately is very helpful to biologists and also facilitates biomedical text analysis. Among existing approaches, text alignment algorithms are simple, effective and require no training data. However, state of the art alignment algorithms could not identify the definitions of irregular abbreviations (e.g., <CNS1, cyclophilin seven suppressor>). We propose an algorithm analogous to pairwise sequence alignment, in which it is given a penalty score if there are two unmatched characters separately from the abbreviation and definition, and in this way some irregular abbreviations are found.

Keywords :

information retrieval; medical information systems; text analysis; abbreviation extraction; biomedical literature retrieval; biomedical text analysis; pairwise sequence alignment algorithm; text alignment algorithm; Biology; Data mining; Dynamic programming; Heuristic algorithms; High performance computing; Laboratories; Machine learning; Text analysis; Text mining; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Knowledge Discovery and Data Mining, 2008. WKDD 2008. First International Workshop on

Conference_Location :

Adelaide, SA

Print_ISBN :

978-0-7695-3090-1

Type :

conf

DOI :

10.1109/WKDD.2008.53

Filename :

4470361

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=472429