DocumentCode :
2349833
Title :
Coupling data understanding with software reuse
Author :
Novak, Gordon S., Jr.
Author_Institution :
Department of Computer Sciences, University of Texas at Austin 78712, USA
fYear :
2008
fDate :
13-15 July 2008
Firstpage :
110
Lastpage :
115
Abstract :
Reuse of information requires an ability to understand data gathered from the web and to integrate that data with knowledge and reusable programs. We describe systems that allow a user to capture and understand data from the web and rapidly and easily write programs to analyze the data and combine it with other data. A data grokker parses data, inferring the data types of its fields both from field names and from values of the data itself; this produces both a local set of usable data and a set of data type descriptions that link the data to known types. The known types have knowledge and reusable procedures that can be inherited and used with the data. Web pages that perform calculations or data lookup can be treated as remote procedure calls, allowing calculations, proprietary data and real-time data to be used. We have developed a graphical programming system that can specialize reusable programs for use with data from the web, allowing rapid and easy construction of programs for custom analysis of web data. These systems are illustrated with examples.
Keywords :
Concrete; Data analysis; Data structures; Databases; HTML; Performance analysis; Software reusability; Uniform resource locators; Web pages; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2008. IRI 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV, USA
Print_ISBN :
978-1-4244-2659-1
Electronic_ISBN :
978-1-4244-2660-7
Type :
conf
DOI :
10.1109/IRI.2008.4583014
Filename :
4583014
Link To Document :
بازگشت