Share Email Print

Proceedings Paper

Automatic segmentation of subfigure image panels for multimodal biomedical document retrieval
Author(s): Beibei Cheng; Sameer Antani; R. Joe Stanley; George R. Thoma
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Biomedical images are often referenced for clinical decision support (CDS), educational purposes, and research. The task of automatically finding the images in a scientific article that are most useful for the purpose of determining relevance to a clinical situation is traditionally done using text and is quite challenging. We propose to improve this by associating image features from the entire image and from relevant regions of interest with biomedical concepts described in the figure caption or discussion in the article. However, images used in scientific article figures are often composed of multiple panels where each sub-figure (panel) is referenced in the caption using alphanumeric labels, e.g. Figure 1(a), 2(c), etc. It is necessary to separate individual panels from a multi-panel figure as a first step toward automatic annotation of images. In this work we present methods that add make robust our previous efforts reported here. Specifically, we address the limitation in segmenting figures that do not exhibit explicit inter-panel boundaries, e.g. illustrations, graphs, and charts. We present a novel hybrid clustering algorithm based on particle swarm optimization (PSO) with fuzzy logic controller (FLC) to locate related figure components in such images. Results from our evaluation are very promising with 93.64% panel detection accuracy for regular (non-illustration) figure images and 92.1% accuracy for illustration images. A computational complexity analysis also shows that PSO is an optimal approach with relatively low computation time. The accuracy of separating these two type images is 98.11% and is achieved using decision tree.

Paper Details

Date Published: 24 January 2011
PDF: 11 pages
Proc. SPIE 7874, Document Recognition and Retrieval XVIII, 78740Z (24 January 2011); doi: 10.1117/12.873685
Show Author Affiliations
Beibei Cheng, Missouri Univ. of Science and Technology (United States)
Sameer Antani, National Library of Medicine, National Institutes of Health (United States)
R. Joe Stanley, Missouri Univ. of Science and Technology (United States)
George R. Thoma, National Library of Medicine, National Institutes of Health (United States)

Published in SPIE Proceedings Vol. 7874:
Document Recognition and Retrieval XVIII
Gady Agam; Christian Viard-Gaudin, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?