摘要(英) |
With the development of video and audio entertainment, not only TVs, movies, but also popular video streaming platforms Youtube and Twitch are pursuing higher
and higher image quality. Recently, live broadcast is becoming more popular. In terms of hardware, the screens of TV are getting larger and larger. In order to effectively compress the huge data volume of high-resolution images, HEVC (High Efficiency Video Coding) uses many methods to effectively reduce bit transmission. In this paper, we apply the SVM (Support Vector Machine) model to the encoding side of HEVC inter prediction classify the depth of the coding unit, and use the motion vector variation information of the inter prediction coding unit and the CBF information of the merge mode. And the depth information of the adjacent blocks is used as the features of the training SVM model to classify a CTU, which is divided into four categories: Subgroup0, 1, 2, and 3. Subgroup0 contains CTU depth 0, and Subgroup1
contains depth 0, 1. , Subgroup2 contains depth 0, 1, 2 and Subgroup3 contains depth 0, 1, 2, 3. Finally, the best depth of CTU will be selected after the RDO process. This algorithm can save 23.5% of the encoding time at the encoder, but it increases by 0.07 % BDBR, so we decided to use post-processing technology to compensate for
the coding performance caused by saving coding time at the decoder. We use the convolutional neural network CNN (Convolutional Neural Network) model in HEVC
post-processing to improve Image quality. In the experiment, the side information concept mentioned in the information theory is combined. The more side information
can reduce the more uncertainty, so in addition to the input of the distorted image after compression, the features used in the SVM model at the encoder will be added as the second input. It can help CNN model training more accurately. Finally, the error caused by quantization in the encoding will be added as the third input of the CNN model. So after we combined the overall architecture, compared with the reference program HM16.0, our algorithm achieves up to 6.59% BDBR reduction and 0.237dB BDPSNR increase. |
參考文獻 |
[1] JCT-VC, “High efficiency video coding (HEVC) test model 15(HM15) encoder description,” JCTVC-Q1002, JCT-VC Meeting, Valencia, ES, Apr. 2014.
[2] I. E. G. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Aberdeen, U.K.: John Wiley & Sons, 2003.
[3] “Generic coding of moving pictures and associated audio information,” ISO/IEC 13818-2: Video (MPEG-2), May 1996.
[4] P. Helle, S. Oudin, B. Bross, D. Marpe, M. O. Bici, K. Ugur, J. Jung, G. Clare, and T. Wiegand, “Block merging for quadtree-based partitioning in HEVC,” in
Proc. IEEE Transactions on circuits and systems for video technology, vol. 22, no.12, pp. 1720-1731, Dec. 2012.
[5] L. Zhao, X. Guo, S. Lei, S. Ma and D. Zhao, “Simplified AMVP for high efficiency video coding,” in Proc. IEEE ICIP, pp. 1-4, 27-30 Nov. 2012.
[6] J. L. Lin, Y. W. Chen, Y. W. Huang, and S. M. Lei, “Motion vector coding in the HEVC standard,” in Proc. IEEE Journal of Selected Topics in Signal Processing,
vol. 7, no. 6, pp. 957-968, 3 July 2013.
[7] Y. Ismail and S. El-etriby, “Fast diamond search algorithm for real time video coding,” in Proc. IEEE ICNC, pp. 729-733, Feb. 2012.
[8] LIBSVM—A Library for Support Vector, Machines
http://www.csie.ntu.edu.tw/~cjlin/libsvm/index.html
[9] K. Alex, I. Sutskever, and G. E. Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” in Advances in Neural Information Processing Systems, pp.1097-1105, 2012.
[10] Y. Lecun, et al., “Gradient-based learning applied to document recognition”, Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
[11] I. Mrazova, M. Kukacka, “Hybrid convolutional neural networks”, Industrial Informatics INDIN 2008. 6th IEEE International Conference, 2008.
[12] S. Lawrence, et al., “Face recognition: A convolutional neural-network approach”, IEEE Transactions on Neural Networks, vol.8, no. 1, pp. 98-113, 1997.
[13] J.K. Liu, “Efficient HEVC inter prediction using SVM,” Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., Jan 2019.
[14] S.J Cai, “Reduction of computation complexity for HEVC intra prediction with support vector machine,” National Central University, Master Thesis, Jun 2017.
[15] C. Li, L. Song, R. Xie, W. Zhang, "Cnn Based Post-Processing To Improve Hevc", International Conference on Image Processing(ICIP) 2017, pp.4577-4580
[16] J. Kim, J.K. Lee, K.M. Lee, “Accurate Image Super-Resolution Using Very Deep Convolutional Networks”, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 1646-1654
[17] X. He, Q. Hu, X. Han, X. Zhang, C. Zhang, W. Lin, "Enhancing Hevc Compressed Videos With A Partition-Masked Convolutional Neural Network", International Conference on Image Processing(ICIP) 2018, pp.216-220
[18] Daowen Li, Lu Yu, “An In-Loop Filter Based on Low-Complexity CNN using Residuals in Intra Video Coding”, 2019 IEEE International Symposium on Circuits and Systems (ISCAS)
[19] S.M. Fan, “Study of A Deep Learning Architecture For HEVC Decoder”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., Jan 2020.
[20] Y.C. Chang, “A Combined Support Vector Machine and Convolutional Neural Network Architecture for HEVC”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., Jan 2020.
[21] Kaiming He , Xiangyu Zhang , Shaoqing Ren , Jian Sun, “Deep Residual Learning for Image Recognition”, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
[22] C.H. Chan, “CNN-based post-processing for HEVC intra prediction”, Department of Communication Engineering National Central University, Taiwan 32054, R.O.C., JUL 2020.
[23] H. Zhang, L. Song, Z. Luo, X. Yang, “Learning a Convolutional Neural Network for Fractional Interpolation in HEVC Inter Coding”, 2017 IEEE Visual Communications and Image Processing (VCIP)
[24] C.H. Yeh, Z.T. Zhang, M.J. Chen, C.Y. Lin, “HEVC Intra Frame Coding Based on Convolutional Neural Network”, IEEE Access p.p. 50087 – 50095
[25] R. Yang, M. Xu, Z. Wang, “Decoder-side hevc quality enhancement with scalable convolutional neural network,” in Multimedia and Expo (ICME), 2017 IEEE International Conference on. IEEE, 2017, pp. 817–822.
[26] F. Li, W. Tan, B. Yan,“Deep Residual Network for Enhancing Quality of the Decoded Intra Frames of Hevc”, 2018 25th IEEE International Conference on Image Processing (ICIP)
[27] J. Xu, L. Song, R. Xie,"Shot boundary detection using convolutional neural networks", Visual Communications and Image Processing (VCIP), 2016. IEEE,
2016, pp. 1–4.
[28] Grand Challenge ICIP 2017, "Grand challenge on the use of image restoration for video coding efficiency improvement", Chttps://storage.googleapis.com/icip
2017/index.html.
[29] X. He, Q. Hu, X. Han, X. Zhang, C. Zhang, W. Lin, "Enhancing Hevc Compressed Videos With A Partition-Masked Convolutional Neural Network", International Conference on Image Processing(ICIP) 2018, pp.216-220
[30] Y.Dai, D. Liu, F.Wu, "A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding", MultiMedia Modeling(MMM) 2017, pp.2839 |