Share Email Print
cover

Proceedings Paper

Captioning of full motion video from unmanned aerial platforms
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this work, we aim to address the needs of human analysts to consume and exploit data given the proliferation of overhead imaging sensors. We have investigated automatic captioning methods capable of describing and summarizing scenes and activities by providing textual descriptions using natural language for overhead full motion video (FMV). We have integrated methods to provide three types of outputs: (1) summaries of short video clips; (2) semantic maps, where each pixel is labeled with a semantic category; and (3) dense object description to capture object attributes and activities. We show results obtained from VIRAT and Aeroscapes publicly available datasets.

Paper Details

Date Published: 13 May 2019
PDF: 12 pages
Proc. SPIE 10992, Geospatial Informatics IX, 1099202 (13 May 2019); doi: 10.1117/12.2518163
Show Author Affiliations
Marc Bosch , Johns Hopkins Univ. Applied Physics Lab., LLC (United States)
Accenture Federal Services (United States)
Christopher Gifford, Johns Hopkins Univ. Applied Physics Lab., LLC (United States)
Agata Ciesielski, Johns Hopkins Univ. Applied Physics Lab., LLC (United States)
Scott Almes, Johns Hopkins Univ. Applied Physics Lab., LLC (United States)
Rachel Ellison, Johns Hopkins Univ. Applied Physics Lab., LLC (United States)
Gordon Christie, Johns Hopkins Univ. Applied Physics Lab., LLC (United States)


Published in SPIE Proceedings Vol. 10992:
Geospatial Informatics IX
Kannappan Palaniappan; Peter J. Doucette; Gunasekaran Seetharaman, Editor(s)

© SPIE. Terms of Use
Back to Top
PREMIUM CONTENT
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?
close_icon_gray