HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment

Xinyi Wang; Angeliki Katsenou; David Bull

ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment

Abstract

With the rapid growth of User-Generated Content (UGC) exchanged between users and sharing platforms, the need for video quality assessment in the wild is increasingly evident. UGC is typically acquired using consumer devices and undergoes multiple rounds of compression (transcoding) before reaching the end user. Therefore, traditional quality metrics that employ the original content as a reference are not suitable. In this paper, we propose ReLaX-VQA, a novel No-Reference Video Quality Assessment (NR-VQA) model that aims to address the challenges of evaluating the quality of diverse video content without reference to the original uncompressed videos. ReLaX-VQA uses frame differences to select spatio-temporal fragments intelligently together with different expressions of spatial features associated with the sampled frames. These are then used to better capture spatial and temporal variabilities in the quality of neighbouring frames. Furthermore, the model enhances abstraction by employing layer-stacking techniques in deep neural network features from Residual Networks and Vision Transformers. Extensive testing across four UGC datasets demonstrates that ReLaX-VQA consistently outperforms existing NR-VQA methods, achieving an average SRCC of 0.8658 and PLCC of 0.8873. Open-source code and trained models that will facilitate further research and applications of NR-VQA can be found at https://github.com/xinyiW915/ReLaX-VQA.

Code Repositories

xinyiw915/relax-vqa
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-quality-assessment-on-konvid-1kReLaX-VQA (trained on LSVQ only)
PLCC: 0.8427
video-quality-assessment-on-konvid-1kReLaX-VQA
PLCC: 0.8473
video-quality-assessment-on-konvid-1kReLaX-VQA (finetuned on KoNViD-1k)
PLCC: 0.8668
video-quality-assessment-on-live-vqcReLaX-VQA (finetuned on LIVE-VQC)
PLCC: 0.8876
video-quality-assessment-on-live-vqcReLaX-VQA (trained on LSVQ only)
PLCC: 0.8242
video-quality-assessment-on-live-vqcReLaX-VQA
PLCC: 0.8079
video-quality-assessment-on-youtube-ugcReLaX-VQA (finetuned on YouTube-UGC)
PLCC: 0.8652
video-quality-assessment-on-youtube-ugcReLaX-VQA (trained on LSVQ only)
PLCC: 0.8354
video-quality-assessment-on-youtube-ugcReLaX-VQA
PLCC: 0.8204

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Papers | HyperAI