DocumentCode :
1621730
Title :
Hierarchical document signature: A specialized application of fuzzy signature for document computing
Author :
Manna, Sukanya ; Mendis, B. Sumudu Udaya ; Gedeon, Tom
Author_Institution :
Sch. of Comput. Sci., Australian Nat. Univ., Canberra, ACT, Australia
fYear :
2009
Firstpage :
1083
Lastpage :
1088
Abstract :
We develop document computing procedures for the analysis of discourse structures within a document, represented by hierarchical document signatures. A signature is a string of data characterizing a certain case (e.g. characteristics of a sentence in case of a document). The place of the individual data is fixed within the string, it holds a local value semantics. Fuzzy granulation is a semantic background technique for all kinds of information which originates from human estimation or recorded by human valuation of numerical data. For analysis of such data the development of special procedures is suggested, different from the usual statistical methods. We used a form of fuzzy signature, called hierarchical document signature to modularize an unstructured document in a hierarchical manner, from Document level to sentence level, sentence level to attribute level and then to word level. We used occurrence of words as the information of the lowest module to find the similarity among the next higher module by aggregating the signature values giving sentence pair coherence.
Keywords :
data analysis; document handling; fuzzy set theory; statistical analysis; discourse structure; document computing; fuzzy signature; hierarchical document signature; human estimation; semantic background technique; statistical method; Application software; Computational linguistics; Computer science; Cost accounting; Data analysis; Filtering; Fuzzy sets; Humans; Statistical analysis; Text analysis; aggregation; document signature; fuzzy measure; fuzzy signatures; sentence similarity; vector valued fuzzy set;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
Conference_Location :
Jeju Island
ISSN :
1098-7584
Print_ISBN :
978-1-4244-3596-8
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2009.5277054
Filename :
5277054
Link To Document :
بازگشت