DocumentCode
2895150
Title
Application of Information Extraction in Oil Search Engine
Author
Ye Fei-Yue ; Bian Li-ya ; Li Hang
Author_Institution
Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
fYear
2009
fDate
7-8 Nov. 2009
Firstpage
98
Lastpage
102
Abstract
With the advancement of internet technology, data on internet have become increasingly huge. Therefore, how to find out the valuable pages from a magnitude of information has been an urgent problem, which should be followed upright. Tradition search engines, based on the query of keywords, have no ability of understanding user demand well. To make up with its deficiency, the concept in which it is combined with technology of information exaction, which adopts simple technology of nature language including technologies of Named Entity Recognition and Hidden Markov Model, syntax analysis, tempo analysis and reasoning technology, is implemented in our search engine for the Oil field is introduced in the paper. Thereby, the vertical search engine is just in possession of the ability of discovering user demand so that it can give back the exact result to users. Besides, it makes use of Regular Expression technology to deal with those which are simple and have no need to make more reasoning to reduce the complicacy of algorithm.
Keywords
Internet; hidden Markov models; inference mechanisms; information retrieval; oil technology; search engines; Internet; hidden Markov model; information extraction; named entity recognition; nature language; oil search engine; reasoning technology; syntax analysis; tempo analysis; Data engineering; Data mining; Hidden Markov models; Information analysis; Internet; Paper technology; Petroleum; Search engines; Telephony; Web pages; Hidden Markov Model; Information Extraction; Named Entity Recognition; Regular Expression;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Systems and Mining, 2009. WISM 2009. International Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3817-4
Type
conf
DOI
10.1109/WISM.2009.28
Filename
5368166
Link To Document