arXiv Analytics

Sign in

arXiv:1910.13646 [eess.IV]AbstractReferencesReviewsResources

C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network

Munan Xu, Junming Chen, Haiqiang Wang, Shan Liu, Ge Li, Zhiqiang Bai

Published 2019-10-30Version 1

Traditional video quality assessment (VQA) methods evaluate localized picture quality and video score is predicted by temporally aggregating frame scores. However, video quality exhibits different characteristics from static image quality due to the existence of temporal masking effects. In this paper, we present a novel architecture, namely C3DVQA, that uses Convolutional Neural Network with 3D kernels (C3D) for full-reference VQA task. C3DVQA combines feature learning and score pooling into one spatiotemporal feature learning process. We use 2D convolutional layers to extract spatial features and 3D convolutional layers to learn spatiotemporal features. We empirically found that 3D convolutional layers are capable to capture temporal masking effects of videos.We evaluated the proposed method on the LIVE and CSIQ datasets. The experimental results demonstrate that the proposed method achieves the state-of-the-art performance.

Related articles: Most relevant | Search more
arXiv:2012.03242 [eess.IV] (Published 2020-12-06)
Esophageal Tumor Segmentation in CT Images using a 3D Convolutional Neural Network
arXiv:2107.13820 [eess.IV] (Published 2021-07-29, updated 2021-08-02)
The interpretation of endobronchial ultrasound image using 3D convolutional neural network for differentiating malignant and benign mediastinal lesions
arXiv:2103.11695 [eess.IV] (Published 2021-03-22)
Predicting brain-age from raw T 1 -weighted Magnetic Resonance Imaging data using 3D Convolutional Neural Networks
Lukas Fisch et al.