Title :
QFL for the Web Data Extraction from Multiple Data Sources
Author :
Borle, Shivani W. ; Potgantwar, Amol D.
Author_Institution :
Sandip Inst. of Technol. & Res. Centre, Savitribai Phule Pune Univ., Mahiravani, India
Abstract :
In order to easily query and blend structured data on the web a query formulation language is presented. The core novelty of this is that it permits people with restricted IT skills to explore and query one (or multiple) data sources without prior knowledge about the schema, structure, terminology, or any technological details of these sources. Data source considered may be either an offline or inline schema. This may need several language-design and performance obstacle that I basically need to deal with. I select querying RDF, as it is the most primitive data model, N-gram models of Natural Language, used for predicting the class of the words given in the input query. The words of the query may be classified into noun phrase, verbs and adjectives for understanding the context of the query. Using this I can group the query syntactically as well as semantically. The former demonstrates how MashQL can be used to query and mash up the Data web.
Keywords :
Internet; natural language processing; query processing; MashQL; N-gram model; QFL; Web data extraction; natural language model; query formulation language; structured data blend; structured data query; Context; Data models; Databases; Natural languages; Resource description framework; Search engines; NLP; Query formulation; RDF; Semanticdata web;
Conference_Titel :
Computing Communication Control and Automation (ICCUBEA), 2015 International Conference on
Conference_Location :
Pune
DOI :
10.1109/ICCUBEA.2015.90