Share Email Print

Proceedings Paper

Real-time pedestrian video segmentation using memory network
Author(s): Fan Zhao; Jin Liu
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We propose a fast and efficient method for pedestrian video segmentation. Previous methods can only use the first frame or the previous frame or a combination of the two, but in our framework, all past frames can be used by using memory network. The past frames with corresponding masks form the memory, and the current frame as the target will be segmented using the information from the memory instead of itself for only. The solution can better handle the problems such as movement and appearance changes in the video. ResUnet is used as the segmentation network to improve time efficiency. Since no dataset is publicly available yet for pedestrian video segmentation, we have internally labeled a large dataset which contains 216 sequences in the training set and 24 sequences in the test set and it will be made public in the future. We validate our method on the test set and achieved the mean IU of 92.6 which is better than using previous methods while keeping real-time(90FPS for input of 160*96 on a TITAN V).

Paper Details

Date Published: 14 February 2020
PDF: 7 pages
Proc. SPIE 11432, MIPPR 2019: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications, 114320M (14 February 2020); doi: 10.1117/12.2541903
Show Author Affiliations
Fan Zhao, Wuhan Univ. (China)
Jin Liu, Wuhan Univ. (China)

Published in SPIE Proceedings Vol. 11432:
MIPPR 2019: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications
Zhiguo Cao; Jie Ma; Zhong Chen; Yu Shi, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?