Share Email Print

Proceedings Paper

The impact of the data archiving file format on the sharing of scientific data for use in popular computational environments
Author(s): Kelly Bennett; James Robertson
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The U.S. Army Research Laboratory (ARL) conducted an initial study on the performance of XML and HDF5 in three popular computational software environments, MATLAB, Octave, and Python, all of which use high-level scripting languages and computational software tools designed for computational processing. Although usable for sharing and exchanging data, the initial results of the study indicated XML has clear limitations in a computational environment. Popular computational tools are unable to handle very large XML formatted files, thus limiting processing of large XML archived data files. We show the breakdown points of XML formatted files for various popular computational tools and explore the performance dependencies of XML and HDF5 formatted files in popular computational environments on the hardware, operating system, and mathematical function. This study also explores the inverse file size relationship between HDF5 and XML data files. Several organizations, including ARL, use both XML and HDF5 for archiving and exchanging data. XML is best suited for storing "light" data (such as metadata) and HDF5 is best suited for storing "heavy" scientific data. Integrating and using both XML and HDF5 for data archiving offers the best solution for data providers and consumers to share information for computational and scientific purposes.

Paper Details

Date Published: 4 May 2010
PDF: 12 pages
Proc. SPIE 7687, Active and Passive Signatures, 76870F (4 May 2010);
Show Author Affiliations
Kelly Bennett, Army Research Lab. (United States)
James Robertson, Clearhaven Technologies LLC (United States)

Published in SPIE Proceedings Vol. 7687:
Active and Passive Signatures
G. Charmaine Gilbreath; Chadwick T. Hawley, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?