DocumentCode
3436582
Title
Integration of Patent and Company Databases
Author
Magnani, Matteo ; Montesi, Danilo
Author_Institution
Univ. of Bologna, Bologna
fYear
2007
fDate
6-8 Sept. 2007
Firstpage
163
Lastpage
171
Abstract
In this paper we describe an activity of information integration performed on databases with patent data and company indicators. In particular, we present a detailed case study on company name matching. We show how to choose and tune existing methods to work on the domain object of this paper, and describe an efficient implementation to process large volumes of data. The integration activity involves the application of approximate string matching techniques. Then, we show the experimental results obtained on real data sets, highlighting the pros and cons of approximate string matching in this specific domain, and analyze the impact of domain knowledge on the results of the matching activity.
Keywords
database management systems; patents; string matching; approximate string matching techniques; company databases; company name matching; information integration activity; patent integration; tune existing methods; Bioinformatics; Companies; Computer science; Java; Merging; Middleware; Relational databases; Spatial databases; Standardization; TV;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Engineering and Applications Symposium, 2007. IDEAS 2007. 11th International
Conference_Location
Banff, Alta.
ISSN
1098-8068
Print_ISBN
978-0-7695-2947-9
Type
conf
DOI
10.1109/IDEAS.2007.4318101
Filename
4318101
Link To Document