Share Email Print

Proceedings Paper

Semantic video segmentation using both appearance and geometric information
Author(s): Jihwan Woo; Kris Kitani; Sehoon Kim; Hantak Kwak; Woosung Shim
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The segmentation is the first step and core technology for semantic understanding of the video. Many tasks in the computer vision such as tracking, recognition and 3D reconstruction, etc. rely on the segmentation result as preprocessing. However, the video segmentation has been known to be a very complicated and hard problem. The objects in the video change their colors and shapes according to the surrounding illumination, the camera position, or the object motion. The color, motion, or depth has been utilized individually as a key clue for the segmentation in many researches. However, every object in the image is composed of several features such as color, texture, depth and motion. That is why single-feature based segmentation method often fails. Humans can segment the objects in video with ease because the human visual system enables to consider color, texture, depth and motion at the same time. In this paper, we propose the video segmentation algorithm which is motivated by the human visual system. The algorithm performs the video segmentation task by simultaneously utilizing the color histogram of the color, the optical flow of the motion, and the homography of the structure. Our results show that the proposed algorithm outperforms other appearance based segmentation method in terms of semantic quality of the segmentation [15]. The proposed segmentation method will serve as a basis for better high-level tasks such as recognition, tracking [3],[4] and video understanding [1].

Paper Details

Date Published: 8 February 2015
PDF: 9 pages
Proc. SPIE 9406, Intelligent Robots and Computer Vision XXXII: Algorithms and Techniques, 94060B (8 February 2015); doi: 10.1117/12.2083165
Show Author Affiliations
Jihwan Woo, Samsung Electronics Co., Ltd. (Korea, Republic of)
Kris Kitani, Carnegie Mellon Univ. (United States)
Sehoon Kim, Samsung Electronics Co., Ltd. (Korea, Republic of)
Hantak Kwak, Samsung Electronics Co., Ltd. (Korea, Republic of)
Woosung Shim, Samsung Electronics Co., Ltd. (Korea, Republic of)

Published in SPIE Proceedings Vol. 9406:
Intelligent Robots and Computer Vision XXXII: Algorithms and Techniques
Juha Röning; David Casasent, Editor(s)

© SPIE. Terms of Use
Back to Top