Share Email Print

Proceedings Paper

Video-assisted segmentation of speech and audio track
Author(s): Medha Pandit; Yusseri Yusoff; Josef Kittler; William J. Christmas; E. H. S. Chilton
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Video database research is commonly concerned with the storage and retrieval of visual information invovling sequence segmentation, shot representation and video clip retrieval. In multimedia applications, video sequences are usually accompanied by a sound track. The sound track contains potential cues to aid shot segmentation such as different speakers, background music, singing and distinctive sounds. These different acoustic categories can be modeled to allow for an effective database retrieval. In this paper, we address the problem of automatic segmentation of audio track of multimedia material. This audio based segmentation can be combined with video scene shot detection in order to achieve partitioning of the multimedia material into semantically significant segments.

Paper Details

Date Published: 24 August 1999
PDF: 10 pages
Proc. SPIE 3846, Multimedia Storage and Archiving Systems IV, (24 August 1999);
Show Author Affiliations
Medha Pandit, Univ. of Surrey (United Kingdom)
Yusseri Yusoff, Univ. of Surrey (United Kingdom)
Josef Kittler, Univ. of Surrey (United Kingdom)
William J. Christmas, Univ. of Surrey (United Kingdom)
E. H. S. Chilton, Univ. of Surrey (United Kingdom)

Published in SPIE Proceedings Vol. 3846:
Multimedia Storage and Archiving Systems IV
Sethuraman Panchanathan; Shih-Fu Chang; C.-C. Jay Kuo, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?