Share Email Print

Proceedings Paper

Presentation of structured documents without a style sheet
Author(s): eroxSteven J. Harrington; Elizabeth Wayman
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

In order to present most XML documents for human consumption, formatting information must be introduced and applied. Formatting is typically done through a style sheet, however, it is conceivable that one could wish to view the document without having a style sheet (either because a style sheet does not exist, or is unavailable, or is inappropriate for the display device). This paper describes a method for formatting structured documents without a provided style sheet. The idea is to first analyze the document to determine structures and features that might be relevant to style decisions. A transformation can be constructed to convert the original document to a generic form that captures the semantics that will be expressed through formatting and style. In the second stage styling is applied to the structures and features that have been discovered by applying a pre-defined style sheet for the generic form. The document instance, and if available, the corresponding schema or DTD can be analyzed in order to construct the transformation. This paper will describe the generic form used for formatting and techniques for generating transformations to it.

Paper Details

Date Published: 29 January 2007
PDF: 9 pages
Proc. SPIE 6500, Document Recognition and Retrieval XIV, 65000J (29 January 2007); doi: 10.1117/12.697178
Show Author Affiliations
eroxSteven J. Harrington, Xerox Corp. (United States)
Elizabeth Wayman, Xerox Corp. (United States)

Published in SPIE Proceedings Vol. 6500:
Document Recognition and Retrieval XIV
Xiaofan Lin; Berrin A. Yanikoglu, Editor(s)

© SPIE. Terms of Use
Back to Top