摘要(英) |
The covid-19 virus has spread all over the world in recent years. It has infected 500 million people and killed more than 6 million people, and the number is still rising; Taiwan is facing the epidemic, which has also caused tens of thousands of people to be diagnosed and need to be isolated at home. In order to reduce the risk of infection, schools have changed to online teaching; most of all the companies have also changed to online meetings to avoid gatherings. Limited by the screen resolution, system processing capability, storage capacity, and network bandwidth viewed by the user, it becomes a very important issue in transmitting high-quality or ultra-high-quality image compression technology. Reduce latency or distortion of the image and complete transmission of the sound to the other party at the same time is a very hot research topic.
The new generation of HEVC (High Efficiency Video Coding) compression standard can achieve 50% the compression efficiency compared to the previous generation H.264 in the same image quality. But the encoding operations are very complicated and cause irreversible image distortion. Distortion compensation is one of the methods. This paper continues the post-processing research, using the spatial domain correlation between pixels to extract image features, including Gaussian Mask and Residual algorithms, and enrolls the information theory in Introduces multiple side information structures. In this paper, we propose two CNN (Convolutional Neural Networks) models, VDSR (Very Deep Super-Resolution) and EDSR (Enhanced Deep Super-Solution) as the main architecture, to discuss the effect of Inter Prediction on encoder post-process under different CNN architectures. In the evaluation of processing efficiency, the CNN model is trained with different algorithm features. The image test results of the CNN with VDSR as the main structure have significantly improved the image quality of dual-input and triple-input, Forthmore, the test results of the CNN with the EDSR main structure, The results of SVM (Super Vector Machine) classification are also significantly better than the former, BDBR is reduced by about 0.381%~0.701%, and BDPSNR is improved by about 0.037~0.171db. It is particularly noteworthy that when the two architectures are in the same bitrate, EDSR’s BD-PSNR can be nearly doubled. |
參考文獻 |
[1] J. W. &. Sons, “E. G. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Aberdeen,” Aberdeen, U.K , 2003.
[2] 維基百科, “High efficiency video coding,” 6 2019. [線上]. Available: https://zh.wikipedia.org/wiki/高效率視頻編碼.
[3] “Coding of audio-visual objects - Part 2: Visual,” ISO/IEC 14496-2 (MPEG-4 Visual Version 1), Apr. 1999.
[4] C.C.Wang, “A Study of DSP implementation of high efficient H.265/HEVC decoder based on OpenHEVC,” Department of Electronic Engineering,I-Shou University, 2015.
[5] 朱秀昌,劉峰,胡棟, H.265/HEVC視頻編碼新標準及其擴展, 北京: 電子工業出版社, 2016.
[6] J.K.Lui, “Reduction of Computational Complexity HEVC Inter Prediction With Support Vector Machine,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, Jan 2019.
[7] Vivienne Sze,Madhukar Budagavi,Gary J.Sullivan, High Efficiency Video Coding(HEVC) Algorithms and Architectures, Springer International Publishing Switzerland , 2014.
[8] 陳允傑, TensorFlow與Keras,Python 深度學習應用實務, 台北: 旗標科技, 2019.
[9] D. Frossard, “Linear Regression with NumPy,Using gradient descent to perform linear regression,” 28 5 2016. [線上]. Available: https://www.cs.toronto.edu/~frossard/post/linear_regression/.
[10] Y.Y.Chen, “Nearly QP-Optimized Post Processing for HEVC Intra Prediction,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, 2021.
[11] Q. H. X. H. X. Z. C. Z. W. L. X. He, “Enhancing Hevc Compressed Videos With A Partition-Masked Convolutional Neural Network,” International Conference on Image Processing(ICIP), pp. 216-220, 2018.
[12] C.K.Hsieh, “CNN-based post-processing for HEVC inter prediction,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, 2020.
[13] B. Lim, S. Son, H. Kim, S. Nah and K. M. Lee, “Enhanced Deep Residual Networks for Single Image Super-Resolution,” IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1132-1140, 2017.
[14] C.H.Chan, “CNN-based post-processing for HEVC intra prediction,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, 2020.
[15] Y.C.Chang, “A Combined Support Vector Machine and Convolutional Neural Network Architecture for HEVC,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, 2020.
[16] K.He, X.Zhang, S.Ren, J.Sun, “Deep Residual Learning for Image Recognition,” The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, 2016.
[17] P.H.Tsui, “Post-Processing for HEVC Intra Prediction with ResNet algorithm,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C, 2022.
[18] J.Kim, J.K. Lee, K.M. Lee, “Accurate Image Super-Resolution Using Very Deep Convolutional Networks,” IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 1646-1654, 2016.
|