Title :
Linear independence in a contingency table
Author :
Tsumoto, Shusaku ; Hirano, Shoji
Author_Institution :
Sch. of Medicine, Shimane Univ., Izumo, Japan
Abstract :
A contingency table summarizes the conditional frequencies of two attributes and shows how these two attributes are dependent on each other with the information on a partition of universe generated by these attributes. Thus, this table can be viewed as a relation between two attributes with respect to information granularity. This paper focuses on several characteristics of linear and statistical independence in a contingency table from the viewpoint of granular computing, which shows that statistical independence in a contingency table is a special form of linear dependence. The discussions also show that when a contingency table is viewed as a matrix, called a contingency matrix, its rank is equal to 1.0. Thus, the degree of independence, rank plays a very important role in extracting a probabilistic model from a given contingency table. Furthermore, it is found that in some cases, partial rows or columns satisfy the condition of statistical independence, which can be viewed as a solving process of Diophatine equations.
Keywords :
data mining; matrix algebra; statistical analysis; Diophatine equations; contingency matrix; contingency table; granular computing; information granularity; linear independence; probabilistic model; statistical independence; Biomedical informatics; Data mining; Equations; Frequency; Information systems; Matrices; Probability; Rough sets; Statistics;
Conference_Titel :
Granular Computing, 2005 IEEE International Conference on
Print_ISBN :
0-7803-9017-2
DOI :
10.1109/GRC.2005.1547371