Share Email Print

Proceedings Paper

Software for automatic analysis of image and sound data simultaneously acquired from high-speed videoendocopy
Author(s): Tao Jiang; Shouhua Luo; Yuling Yan
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

High-speed digital videoendoscopy system is emerging as a new clinical tool for voice assessment. The system can acquire images of the vibrating vocal folds with simultaneous recording of voice data from the patient. The laryngeal image-based analysis has been proven valuable for objective and quantitative assessment of voice kinematics in health and disease, and meanwhile, acoustic analysis of voice data could assist in the study of phonatory characteristics and reveal useful information related to laryngeal pathophysiology. Contrast to the hardware acquisition systems, the development of effective software for handling such massive visual/sound data has lagged behind. In this paper, a software system is designed to process the laryngeal image sequences and perform image-based analyses as well as acoustic analyses. Our software contains following modules: (1) Import and view Module - to read AVI video data and sound data (wave file), edit/compile and save selected data, make image montages using DirectShow technology and display the acoustic waveform using DirectSound technology; (2) Image Process Module – to perform frame-by-frame image segmentation to delineate the glottis, to extract the GAW and bilateral vocal fold displacements; (3) Image Analysis Module – to adopt Nyquist plot displays that involves the Hilbert transform based analysis of GAW, and to provide instantaneous frequency and amplitude distributions; (4) Acoustic Analysis Module – to perform Fast Fourier Transform (FFT) and Spectrogram analyses of the imported sound data, to display the plot of the sound data and provide instantaneous frequency and amplitude distributions and Nyqiust plot and (5) Dual GAW and sound wave display module. Upon rigorous testing of this software using clinical data samples we demonstrate the applications of the software to the study of dynamic characteristics of the glottis, which may correlate with voice quality and health condition.

Paper Details

Date Published: 8 March 2013
PDF: 8 pages
Proc. SPIE 8565, Photonic Therapeutics and Diagnostics IX, 856520 (8 March 2013); doi: 10.1117/12.2014251
Show Author Affiliations
Tao Jiang, Santa Clara Univ. (United States)
Shouhua Luo, Southeast Univ. (China)
Yuling Yan, Santa Clara Univ. (United States)

Published in SPIE Proceedings Vol. 8565:
Photonic Therapeutics and Diagnostics IX
Andreas Mandelis; Brian Jet-Fei Wong; Anita Mahadevan-Jansen; Henry Hirschberg M.D.; Hyun Wook Kang; Nikiforos Kollias; Melissa J. Suter; Kenton W. Gregory M.D.; Guillermo J. Tearney M.D.; Stephen Lam; Bernard Choi; Steen J. Madsen; Bodo E. Knudsen M.D.; E. Duco Jansen; Justus F. Ilgner M.D.; Haishan Zeng; Matthew Brenner; Laura Marcu, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?