DocumentCode :
2369981
Title :
NLP-NG — A new NLP system for biomedical text analysis
Author :
Futrelle, Robert P. ; Satterley, Jeff ; McCormack, Tim
Author_Institution :
Biol. Knowledge Lab., Northeastern Univ., Boston, MA, USA
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
296
Lastpage :
301
Abstract :
NLP-NG is a new NLP system consisting of three components: NG-CORE (language processing), NG-DB (database management), and NG-SEE (interactive visualization and entry). The ultimate goal of NLP-NG is to produce information retrieval systems in which users can choose full-text schema, adding specific items to focus their queries. Schema are created by a normalization process which elides adjunctive constructions as well as replacing items by prototypes. Biomedical text contains domain-specific constructions which are revealed by normalization. NLP-NG is based on Construction Grammar. Computationally, all representations are integer-based, allowing efficient storage, indexing, and retrieval. SEE, an Ajax web browser client, allows developers, linguists, and users to view a corpus and modify its properties. NLP-NG uses a 300 million word BioMed Central corpus. NLP-NG does not focus on specific strategies to extract limited classes of information from papers. Instead, it is a universal approach that can codify a wide variety of text in papers.
Keywords :
full-text databases; grammars; medical computing; natural language processing; text analysis; Ajax; NG-CORE; NG-DB; NG-SEE; NLP system; NLP-NG; Web browser client; biomedical text analysis; construction grammar; corpus; database management; information retrieval systems; interactive visualization and entry; linguists; natural language processing; normalization process; Biology; Biomedical computing; Data mining; Frequency; Laboratories; Proteins; Prototypes; Statistical analysis; Text analysis; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
Type :
conf
DOI :
10.1109/BIBMW.2009.5332110
Filename :
5332110
Link To Document :
بازگشت