Share Email Print
cover

Proceedings Paper

Integration inconsistency removal in data mining
Author(s): Julius Stuller
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

The technological progress in the areas of the hardware, specially in the field of the (secondary) memories where the ever increasing capacities are paradoxically in the last several years available at ever decreasing prices and smaller physical sizes, and the software, continuously more and more user friendly, efficient and cheaper, together with the general expansion of the computers to almost all human activities, make it easier to realize the integration of many already existing databases. Unfortunately the process of databases integration can be accompanied by many various difficulties and problems. One of them is surely the possible occurrence of the inconsistencies appearing in this process of the integration. As we will see these inconsistencies can occur at various levels and they can be of different types. At the next stage some users go even further and try to get more from the accumulated data through data mining techniques. A data warehouse can be considered as a suitable technology for this purpose. Having in mind the data mining view of a data warehouse, one needs to know the sources of possible inconsistencies when building such a data warehouse in order to eliminate them as much as possible. In the paper we will define several existence conditions under which can occur different types of the inconsistencies in a warehouse and we will propose a classification of these inconsistencies based on the their sources. We will also propose a methodology and a procedure both of which aim at the elimination of these inconsistencies.

Paper Details

Date Published: 6 April 2000
PDF: 11 pages
Proc. SPIE 4057, Data Mining and Knowledge Discovery: Theory, Tools, and Technology II, (6 April 2000); doi: 10.1117/12.381743
Show Author Affiliations
Julius Stuller, Institute of Computer Science (Czech Republic)


Published in SPIE Proceedings Vol. 4057:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology II
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top