DocumentCode :
1659685
Title :
Extracting Named Entities at Web Scale for Competitive Intelligence
Author :
Pouilloux, François
Author_Institution :
IXXO, France
Volume :
1
fYear :
2011
Firstpage :
501
Lastpage :
501
Abstract :
Summary form only given.Businesses of all sizes have now realized that the Web is an invaluable resource for competitive intelligence, and consequently business decision making. But many have trouble collecting targeted & useful information, and are often further overwhelmed by the time required for analysis & monitoring. On another hand, text mining techniques have become widely used for information analysis in the scientific community in general, and are now ubiquitous in most Web Intelligence fields. With the availability of services such as Google Prediction API, or mature open source software such as GATE, RapidMiner or NLTK, one can expect a much wider adoption of text mining and associated machine learning techniques by expert developers. But how can these techniques benefit to the daily life of a wider business audience? As competitive intelligence is often focused on products, people, customers and competitors, there is an added value for systems providing analytics on these entities, whose recognition is fundamental to text mining and semantic analysis, and consequently is still under active scientific investigation. In this talk we will tour some of the specific requirements and options for building an efficient Web based competitive intelligence system with named entity analytics. We will see how some savvy simplifications can help to overcome common issues such as Web scale and Web content noise, and finally deliver acceptable usability and value for non-specialists, business users.
Keywords :
competitive intelligence; data mining; learning (artificial intelligence); public domain software; text analysis; GATE; Web Intelligence; Web based competitive intelligence system; Web content noise; Web scale; business decision making; business users; information analysis; machine learning technique; open source software; semantic analysis; text mining; Communities; Competitive intelligence; Industries; Logic gates; Software; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
Type :
conf
DOI :
10.1109/WI-IAT.2011.284
Filename :
6040721
Link To Document :
بازگشت