Share Email Print

Proceedings Paper

No-reference video quality assessment based on spatiotemporal slice images and deep convolutional neural networks
Author(s): Peng Yan; Xuanqin Mou
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Most learning-based no-reference (NR) video quality assessment (VQA) needs to be trained with a lot of subjective quality scores. However, it is currently difficult to obtain a large volume of subjective scores for videos. Inspired by the success of full-reference VQA methods based on the spatiotemporal slice (STS) images in the extraction of perceptual features and evaluation of video quality, this paper adopts multi-directional video STS images, which are images composed of multi-directional sections of video data, to deal with the lacking of subjective quality scores. By sampling the STS images of video into image patches and adding noise to the quality labels of patches, a successful NR VQA model based on multi-directional STS images and neural network training is proposed. Specifically, first, we select the subjective database that currently contains the largest number of real distortion videos as the test set. Second, we perform multi-directional STS extraction on the videos and sample the local patches from the multi -directional STS to augment the training sample set. Besides, we add some noise to the quality label of the local patches. Third, a reasonable deep neural network is constructed and trained to obtain a local quality prediction model for each patch in the STS image, and then the quality of an entire video is obtained by averaging the model prediction results of multi -directional STS images. Finally, the experiment results indicate that the proposed method tackles the insufficiency of training samples in small subjective VQA dataset and obtains a high correlation with the subjective evaluation.

Paper Details

Date Published: 18 November 2019
PDF: 10 pages
Proc. SPIE 11187, Optoelectronic Imaging and Multimedia Technology VI, 111870A (18 November 2019); doi: 10.1117/12.2536866
Show Author Affiliations
Peng Yan, Xi'an Jiaotong Univ. (China)
Xuanqin Mou, Xi'an Jiaotong Univ. (China)

Published in SPIE Proceedings Vol. 11187:
Optoelectronic Imaging and Multimedia Technology VI
Qionghai Dai; Tsutomu Shimura; Zhenrong Zheng, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?