Share Email Print
cover

Proceedings Paper

Mathematical Foundation of Association Rules: mining generalized associations by linear inequalities
Author(s): Tsau Young Lin; Hugo Shi
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Associations (not necessarily in rule forms) as patterns in data are critically analyzed. We build theory based only on what data says, and no other implicit assumptions. Data mining is regarded as a deductive science: First, we observe that isomorphic relations have isomorphic associations. Somewhat a surprise, such a simple observation turns out to have far reaching consequences. It implies that associations are properties of an isomorphic class, not an individual relation. A similar conclusion can be made for probability theory based on item counting, hence it is not adequate to characterize the "interesting-ness," since the latter one is a property of an individual relation. As a by-product of this analysis, we find that all generalized associations can be found by simply solving a set of integral linear inequalities - this is a very striking result. Finally, we observe that from the structure of the relation lattice, we may conclude that random sampling may loose substantial information about patterns.

Paper Details

Date Published: 21 March 2003
PDF: 10 pages
Proc. SPIE 5098, Data Mining and Knowledge Discovery: Theory, Tools, and Technology V, (21 March 2003); doi: 10.1117/12.498863
Show Author Affiliations
Tsau Young Lin, San Jose State Univ. (United States)
Hugo Shi, Univ. of California/Berkeley (United States)


Published in SPIE Proceedings Vol. 5098:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology V
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top