Share Email Print
cover

Proceedings Paper

Schema extraction and levelization for XML data
Author(s): Jong P. Yoon; Sung-Rim Kim
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

XML is a new standard for representing and exchanging information on the Internet. An XML data is a data that is tagged by XML elements. Such an XML data can be retrieved not only by a Boolean connection with keywords on the Internet. Keyword-based information retrieval does not precisely result in user requests partly because user requests cannot be properly conveyed. Either too many or too few matches are produced. It is not trivial to formulate what to retrieve for a good-sized query-result. In conventional approaches, a database schema is useful for users to formulate queries and for query processing. Likewise, this paper proposes a method of schema extraction for XML data collection. Obtaining one single schema is not sufficient to serve for the good size of information retrieval and adaptively for the various requests from Internet users. To support this, schemas are then levelized with respect to the frequency of topological data structures in a database. The topological structural information of these schemas is used to formulate queries and further to rewrite queries for relaxation and restriction. Without modification, the method proposed in this paper is used not only for multimedia XML data collections but also for general XML databases.

Paper Details

Date Published: 27 March 2001
PDF: 10 pages
Proc. SPIE 4384, Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, (27 March 2001); doi: 10.1117/12.421065
Show Author Affiliations
Jong P. Yoon, Univ. of Louisiana/Lafayette (United States)
Sung-Rim Kim, Sookmyung Womens' Univ. (South Korea)


Published in SPIE Proceedings Vol. 4384:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology III
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top