DocumentCode :
2313676
Title :
A Survey on Web Content Mining and Extraction of Structured and Semistructured Data
Author :
Pol, Kshitija ; Patil, Nita ; Patankar, Shreya ; Das, Chhaya
Author_Institution :
Datta Meghe Coll. of Eng., Airoli, Mumbai
fYear :
2008
fDate :
16-18 July 2008
Firstpage :
543
Lastpage :
546
Abstract :
With the research in information retrieval and phenomenal growth of the Web, todaypsilas Websites have become a key communication and information medium for various organizations. It also offers an unprecedented opportunity and challenges to data mining. Various techniques are available to extract useful data from the web. It is very important for the users to utilize this information effectively which helps them to understand the structure of information on the Web more deeply and precisely. This paper conducts a survey of how Web content mining plays an efficient tool in extracting structured and semi structured data and mining them into useful knowledge.
Keywords :
Internet; data mining; data structures; feature extraction; information retrieval; Web content mining; Web extraction; data mining; information Retrieval; semistructured data; Content based retrieval; Crawlers; Data engineering; Data mining; Educational institutions; Explosives; Information retrieval; Search engines; Web mining; Web pages; Semi Structured data; Web Content mining; structured data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Trends in Engineering and Technology, 2008. ICETET '08. First International Conference on
Conference_Location :
Nagpur, Maharashtra
Print_ISBN :
978-0-7695-3267-7
Electronic_ISBN :
978-0-7695-3267-7
Type :
conf
DOI :
10.1109/ICETET.2008.251
Filename :
4579960
Link To Document :
بازگشت