摘要翻译:
运动估计是所有视频编码器中的重要步骤之一。视频编码器的大部分复杂度取决于运动估计步骤的复杂度。原始的运动估计算法复杂度很高,因此人们提出了许多改进方案来增强原始的运动估计算法。其中许多工作的基本思想是对块匹配中的均方误差(MSE)或绝对差分和(SAD)的失真函数进行优化,但这些度量并不能反映块匹配的质量,而且也不符合人类视觉系统(HVS)的要求。本文探讨了图像质量度量在视频编码中的应用,特别是在运动估计中的应用。在基于块的运动估计中,我们使用了感知图像质量度量来代替MSE或SAD。三种不同的度量方法:结构相似度或SSIM、复小波结构相似度或CW-SSIM、视觉信息保真度或VIF。实验结果表明,使用质量准则可以在质量不变的情况下提高压缩比,从而在相同比特预算下提高编码视频的质量。
---
英文标题:
《A Perceptual Based Motion Compensation Technique for Video Coding》
---
作者:
Amin Banitalebi, Said Nader-Esfahani, and Alireza Nasiri Avanaki
---
最新提交年份:
2018
---
分类信息:
一级分类:Electrical Engineering and Systems Science 电气工程与系统科学
二级分类:Image and Video Processing 图像和视频处理
分类描述:Theory, algorithms, and architectures for the formation, capture, processing, communication, analysis, and display of images, video, and multidimensional signals in a wide variety of applications. Topics of interest include: mathematical, statistical, and perceptual image and video modeling and representation; linear and nonlinear filtering, de-blurring, enhancement, restoration, and reconstruction from degraded, low-resolution or tomographic data; lossless and lossy compression and coding; segmentation, alignment, and recognition; image rendering, visualization, and printing; computational imaging, including ultrasound, tomographic and magnetic resonance imaging; and image and video analysis, synthesis, storage, search and retrieval.
用于图像、视频和多维信号的形成、捕获、处理、通信、分析和显示的理论、算法和体系结构。感兴趣的主题包括:数学,统计,和感知图像和视频建模和表示;线性和非线性滤波、去模糊、增强、恢复和重建退化、低分辨率或层析数据;无损和有损压缩编码;分割、对齐和识别;图像渲染、可视化和打印;计算成像,包括超声、断层和磁共振成像;以及图像和视频的分析、合成、存储、搜索和检索。
--
---
英文摘要:
Motion estimation is one of the important procedures in the all video encoders. Most of the complexity of the video coder depends on the complexity of the motion estimation step. The original motion estimation algorithm has a remarkable complexity and therefore many improvements were proposed to enhance the crude version of the motion estimation. The basic idea of many of these works were to optimize some distortion function for mean squared error (MSE) or sum of absolute difference (SAD) in block matching But it is shown that these metrics do not conclude the quality as it is, on the other hand, they are not compatible with the human visual system (HVS). In this paper we explored the usage of the image quality metrics in the video coding and more specific in the motion estimation. We have utilized the perceptual image quality metrics instead of MSE or SAD in the block based motion estimation. Three different metrics have used: structural similarity or SSIM, complex wavelet structural similarity or CW-SSIM, visual information fidelity or VIF. Experimental results showed that usage of the quality criterions can improve the compression rate while the quality remains fix and thus better quality in coded video at the same bit budget.
---
PDF链接:
https://arxiv.org/pdf/1803.04607