Share Email Print

Proceedings Paper

Segmentation of frames in a video sequence using motion and other attributes
Author(s): Edmond Chalom; V. Michael Bove Jr.
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Motion-compensated video coders typically segment a scene into arbitrary tiles, resulting in a compressed bitstream which is not physically or semantically related to the scene structure. This paper presents a method for segmenting video frames and coding motion of regions, where the regions are defined in terms of a number of different properties. The goal is a video coder which gives good compression while identifying coherent regions in a manner useful for both human users and automated scene-understanding processes. Both a supervised and an unsupervised clustering algorithm are used to segment an image sequence; both algorithms make use of multiple features including motion, texture, position, and color. By utilizing both the structure and motion information, we preserve the semantic/structural content of the different regions, and simultaneously remove the redundancy (in successive frames) by describing the motion information in each region with a six-parameter affine model. In the supervised clustering algorithm, the first frame is manually segmented and used as training data. The classification of subsequent frames is done automatically, by using a MAP estimate, and modeling the n-dimensional feature-space as jointly Gaussian. The unsupervised algorithm is an iterative process that reassigns the classification of each point to the region corresponding to the nearest mean among each region of the segmentation from the previous iteration. In both algorithms, the distance and/or the mean is an n-dimensional measurement, n being the number of features used.

Paper Details

Date Published: 17 April 1995
PDF: 12 pages
Proc. SPIE 2419, Digital Video Compression: Algorithms and Technologies 1995, (17 April 1995); doi: 10.1117/12.206362
Show Author Affiliations
Edmond Chalom, MIT Media Lab. (United States)
V. Michael Bove Jr., MIT Media Lab. (United States)

Published in SPIE Proceedings Vol. 2419:
Digital Video Compression: Algorithms and Technologies 1995
Arturo A. Rodriguez; Robert J. Safranek; Edward J. Delp, Editor(s)

© SPIE. Terms of Use
Back to Top