DocumentCode :
1577715
Title :
Web data extraction and alignment using tag and value similarity
Author :
Pakojwar, K.A. ; Mangrulkar, R.S. ; Bhujade, V.G.
Author_Institution :
Dept. of Comput. Eng., BDCE, Wardha, India
fYear :
2015
Firstpage :
1
Lastpage :
4
Abstract :
Based on the query entered by user web databases generate query result pages. To extract the data from these generated result pages automatically is very useful for various applications, like data integration, which need to work with different web databases. Many techniques are available to extract data and align it but they did not consider the unstructured data. This paper presents a novel technique for data extraction and alignment. It collects data from internet related to user´s query which contains huge amount of data and it can be in unstructured form also. Here the aim is to find out only important data from it and align it in tabular form so that it will be very easy to compare different data.
Keywords :
Internet; query processing; Web data alignment; Web data extraction; Web databases; query processing; tag similarity; value similarity; Conferences; Crawlers; Data mining; Databases; HTML; Technological innovation; Web pages; Crawling; alignment; data extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Information, Embedded and Communication Systems (ICIIECS), 2015 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4799-6817-6
Type :
conf
DOI :
10.1109/ICIIECS.2015.7193044
Filename :
7193044
Link To Document :
بازگشت