DocumentCode
680222
Title
Co-training and ensemble based duplicate detection in adverse drug event reporting systems
Author
Wen-Yang Lin ; Chiao-Feng Lo
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
fYear
2013
fDate
18-21 Dec. 2013
Firstpage
7
Lastpage
8
Abstract
Nowadays, many countries have established spontaneous reporting systems (SRSs) to facilitate postmarketing surveillance of listed drugs and collect enough data for detecting unknown adverse drug reactions. Due to data in SRSs coming from different sources of reporters, there heralds the problem of duplicate reporting; even a small amount of duplicate records would bias the detection results. Although lots of works have been conducted on duplicate record detection, very few of them have been devoted to dataset about adverse drug reactions, and none of them have considered the existence of follow-up reports. In this study, we investigated the problem of identifying duplicate ADR reports in SRSs with the presence of follow-ups. We proposed an ensemble and co-training based detection method that is capable of detecting for a given report not only its duplicates but also its initial or earlier linkage cases.
Keywords
drugs; electronic health records; pattern classification; records management; FAERS database; adverse drug event reporting systems; classification methods; cotraining based detection method; duplicate record detection; ensemble based detection method; spontaneous reporting systems; Accuracy; Bayes methods; Classification algorithms; Couplings; Databases; Drugs; Training; Adverse drug events; co-training learning; duplicate records detection; ensemble learning; follow-up;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location
Shanghai
Type
conf
DOI
10.1109/BIBM.2013.6732591
Filename
6732591
Link To Document