DocumentCode
1172971
Title
The Babylon project: toward an extensible text-mining platform
Author
Merrill, Gary H.
Volume
5
Issue
2
fYear
2003
Firstpage
23
Lastpage
30
Abstract
GlaxoSmithKline committed to adopting Verity, an indexing and search engine, to retrieve knowledge within the company intranet. Verity was attractive because it supported taxonomy-oriented browsing and concept-enhanced search. To tap the power in Verity´s features, GlaxoSmithKline´s Data Exploration Sciences began the Babylon project in late 2002 with two primary goals: develop a text-mining platform that would be the foundation for a variety of text-mining applications and would be extensible to a variety of domains; and develop one or more significant prototype applications on that platform. The first prototype application considered was to mine reports on adverse drug events to identify trends, drug and reaction events, or drug-drug interactions that might not be apparent in nontextual structured databases. The paper discusses the Babylon project.
Keywords
biology computing; data mining; hypermedia markup languages; information retrieval; intranets; search engines; very large databases; Babylon project; Data Exploration Sciences; GlaxoSmithKline; Verity; XML topic maps; adverse drug events; concept-enhanced search; drug interactions; extensible text-mining platform; indexing; information retrieval; intranet; prototype application; search engine; taxonomy-oriented browsing; Genetics; Information retrieval; Knowledge management; Laboratories; Ontologies; Prototypes; Safety; Search engines; Taxonomy; Text mining;
fLanguage
English
Journal_Title
IT Professional
Publisher
ieee
ISSN
1520-9202
Type
jour
DOI
10.1109/MITP.2003.1191789
Filename
1191789
Link To Document