[Back]


Publications in Scientific Journals:

R. Piché, M. Järvenpää, E. Turunen, M. Simunek:
"Bayesian analysis of GUHA hypotheses";
Journal of Intelligent Information Systems, 42 (2014), 1; 47 - 73.



English abstract:
The LISp-Miner system for data mining and knowledge discovery uses the GUHA method to comb through a large data base and finds 2 × 2 contingency tables that satisfy a certain condition given by generalised quantifiers and thereby suggest the existence of possible relations between attributes. In this paper, we show how a more detailed interpretation of the data in the tables that were found by GUHA can be obtained using Bayesian statistical methods. Using a multinomial sampling model and Dirichlet prior, we derive posterior distributions for parameters that correspond to GUHA generalised quantifiers. Examples are presented illustrating the new Bayesian post-processing tools implemented in LISp-Miner. A statistical model for the analysis of contingency tables for data from two subpopulations is also presented.

Keywords:
Data mining, GUHA, Contingency table, Bayesian statistics, 62F15, 62H17, 62-07


"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1007/s10844-013-0255-6


Created from the Publication Database of the Vienna University of Technology.