Share Email Print

Proceedings Paper

Toward integrating text and images for multimedia retrieval in heterogeneous data mining
Author(s): Sumeet Dua; Vinay Mannava
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The problem of heterogeneous data mining deals with the computational challenges of searching multimedia data in a unified computational framework that can answer similarity queries of data mining by accurate and efficient means. The advances in data collection methodologies have generated large data-warehouses, in assortment of application domains, including but not limited to, Internet applications for multimedia retrieval and exchange. Heterogeneous data indexing has proven to be a valuable tool for complex data mining in large data domains inherently semi-structured in nature. We propose a solution to integrate the feature vectors of image and text by cooperatively representing them in a multidimensional spatial data structure, which has previously exhibited superior search performance in image database domains. We have evaluated results of content-based similarity queries on the indexing schema independently in images and textual domains. We have then studied and represented the effect of the choice of similarity metric on the similarity queries. We then propose an indexing schema that integrates the feature vectors of text and images to answer integrated queries on the unified heterogeneous data space. An added advantage of the proposed methodology is embodied by the fact that a textual feature vector can query a heterogeneous database to retrieve both text as well as images as query results. This solves the problem of individually querying each data-domain separately and sequentially scanning the integrated database for similarity results. The proposed methodology is time and space efficient, and is capable of answering complex heterogeneous data mining queries in multimedia domains.

Paper Details

Date Published: 24 October 2005
PDF: 12 pages
Proc. SPIE 6015, Multimedia Systems and Applications VIII, 601513 (24 October 2005); doi: 10.1117/12.630193
Show Author Affiliations
Sumeet Dua, Louisiana Tech Univ. (United States)
Vinay Mannava, Louisiana Tech Univ. (United States)

Published in SPIE Proceedings Vol. 6015:
Multimedia Systems and Applications VIII
Anthony Vetro; Chang Wen Chen; C.-C. J. Kuo; Tong Zhang; Qi Tian; John R. Smith, Editor(s)

© SPIE. Terms of Use
Back to Top