DocumentCode :
2037027
Title :
An approximation method for extracting typical classes from semistructured data
Author :
Suzuki, Nobutaka ; Sato, Yoichirou ; Hayase, Michiyoshi
Author_Institution :
Fac. of Comput. Sci. & Syst. Eng., Okayama Prefectural Univ., Japan
fYear :
1999
fDate :
1999
Firstpage :
197
Lastpage :
200
Abstract :
We consider a class extraction problem over semistructured data. A class C is extracted by grouping objects having similar (not necessarily identical) sets of properties into C, where the set of properties of C is the union of those of the objects in C. Let C be an extracted class and o be an object in C. If C has property P but o has no property P value, then P is null within o. An extracted class c is called typical if the number of nulls in C is small against the number of object in C and the number of properties of C. We present the following results. First, we prove that the problem of deciding if a typical class can be extracted from given semistructured data is NP-complete. Second, we present an approximation algorithm for extracting typical classes from given semistructured data. Finally, we briefly discuss a sufficient condition for the approximation algorithm to run efficiently
Keywords :
data structures; database theory; approximation algorithm; class extraction; semistructured data; similar object grouping; Approximation methods; Computational Intelligence Society; Computer science; Data engineering; Data mining; Systems engineering and theory; US Department of Transportation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Applications in Non-Traditional Environments, 1999. (DANTE '99) Proceedings. 1999 International Symposium on
Conference_Location :
Kyoto
Print_ISBN :
0-7695-0496-5
Type :
conf
DOI :
10.1109/DANTE.1999.844960
Filename :
844960
Link To Document :
بازگشت