Share Email Print

Proceedings Paper

Quantity and unit extraction for scientific and technical intelligence analysis
Author(s): Peter David; Timothy Hawes
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Scientific and Technical (S and T) intelligence analysts consume huge amounts of data to understand how scientific progress and engineering efforts affect current and future military capabilities. One of the most important types of information S and T analysts exploit is the quantities discussed in their source material. Frequencies, ranges, size, weight, power, and numerous other properties and measurements describing the performance characteristics of systems and the engineering constraints that define them must be culled from source documents before quantified analysis can begin. Automating the process of finding and extracting the relevant quantities from a wide range of S and T documents is difficult because information about quantities and their units is often contained in unstructured text with ad hoc conventions used to convey their meaning. Currently, even simple tasks, such as searching for documents discussing RF frequencies in a band of interest, is a labor intensive and error prone process. This research addresses the challenges facing development of a document processing capability that extracts quantities and units from S and T data, and how Natural Language Processing algorithms can be used to overcome these challenges.

Paper Details

Date Published: 3 May 2017
PDF: 7 pages
Proc. SPIE 10207, Next-Generation Analyst V, 102070F (3 May 2017); doi: 10.1117/12.2266039
Show Author Affiliations
Peter David, Decisive Analytics Corp. (United States)
Timothy Hawes, Decisive Analytics Corp. (United States)

Published in SPIE Proceedings Vol. 10207:
Next-Generation Analyst V
Timothy P. Hanratty; James Llinas, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?