Share Email Print
cover

Proceedings Paper

Content-based text mapping using multi-dimensional projections for exploration of document collections
Author(s): Rosane Minghim; Fernando Vieira Paulovich; Alneu de Andrade Lopes
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

This paper presents a technique for generation of maps of documents targeted at placing similar documents in the same neighborhood. As a result, besides being able to group (and separate) documents by their contents, it runs at very manageable computational costs. Based on multi-dimensional projection techniques and an algorithm for projection improvement, it results in a surface map that allows the user to identify a number of important relationships between documents and sub-groups of documents via visualization and interaction. Visual attributes such as height, color, isolines and glyphs as well as aural attributes (such as pitch), help add dimensions for integrated visual analysis. Exploration and narrowing of focus can be performed using a set of tools provided. This novel text mapping technique, named IDMAP (Interactive Document Map), is fully described in this paper. Results are compared with dimensionality reduction and cluster techniques for the same purposes. The maps are bound to support a large number of applications that rely on retrieval and examination of document collections and to complement the type of information offered by current knowledge domain visualizations.

Paper Details

Date Published: 16 January 2006
PDF: 12 pages
Proc. SPIE 6060, Visualization and Data Analysis 2006, 60600S (16 January 2006); doi: 10.1117/12.650880
Show Author Affiliations
Rosane Minghim, Instituto de Ciências Matemáticas e de Computação (Brazil)
Fernando Vieira Paulovich, Instituto de Ciências Matemáticas e de Computação (Brazil)
Alneu de Andrade Lopes, Instituto de Ciências Matemáticas e de Computação (Brazil)


Published in SPIE Proceedings Vol. 6060:
Visualization and Data Analysis 2006
Robert F. Erbacher; Jonathan C. Roberts; Matti T. Gröhn; Katy Börner, Editor(s)

© SPIE. Terms of Use
Back to Top