Title :
Representative objects: concise representations of semistructured, hierarchical data
Author :
Nestorov, Svetlozar ; Ullman, Jeffrey ; Wiener, Janet ; Chawathe, Sudarshan
Author_Institution :
Dept. of Comput. Sci., Stanford Univ., CA, USA
Abstract :
Introduces the concept of representative objects, which uncover the inherent schema(s) in semi-structured, hierarchical data sources and provide a concise description of the structure of the data. Semi-structured data, unlike data stored in typical relational or object-oriented databases, does not have a fixed schema that is known in advance and stored separately from the data. With the rapid growth of the World Wide Web, semi-structured hierarchical data sources are becoming widely available to the casual user. The lack of external schema information currently makes browsing and querying these data sources inefficient at best, and impossible at worst. We show how representative objects make schema discovery efficient and facilitate the generation of meaningful queries over the data
Keywords :
Internet; data structures; database theory; object-oriented databases; query processing; World Wide Web; browsing; concise data representations; data structure; efficiency; inherent schema discovery; querying; representative objects; semistructured hierarchical data sources; Computer science; Data models; Object oriented databases; Pattern matching; Query processing; Relational databases; Web sites;
Conference_Titel :
Data Engineering, 1997. Proceedings. 13th International Conference on
Conference_Location :
Birmingham
Print_ISBN :
0-8186-7807-0
DOI :
10.1109/ICDE.1997.581741