Low-complexity research for next-generation VVC standard and its neural network extension

Research Project

Project/Area Number	21K17770
Research Category	Grant-in-Aid for Early-Career Scientists
Allocation Type	Multi-year Fund
Review Section	Basic Section 61010:Perceptual information processing-related
Research Institution	Yokohama National University (2023) Waseda University (2021-2022)
Principal Investigator	孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)
Project Period (FY)	2021-04-01 – 2025-03-31
Project Status	Granted (Fiscal Year 2023)
Budget Amount *help	¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000) Fiscal Year 2022: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2021: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywords	VVC / Compression / Video coding / Video coding for machine / Transform / Filter / Intra prediction / Neural network / Low complexity
Outline of Research at the Start	Emerging video compression standard Versatile Video Coding (VVC) can double the compression ratio than the previous standard at the cost of high coding complexity. This research will reduce the complexity of VVC, and further enhance its coding gain by exploiting light yet efficient neural networks.
Outline of Annual Research Achievements	This year, I mainly focused on two components in VVC. One is adaptive loop filter which is a new filter adopted in VVC. To reduce the complexity overhead of adaptive loop filter, a hardware-oriented algorithm is developed at first. In detail, filter coefficient is limited and two mapping methods are proposed. After that, the corresponding VLSI architecture is also designed, and it can realize 4K@30fps throughput. Compared with previous work, more than 41% normalized area can be saved. Besides, the proposed coefficient limitation will not influence the compression efficiency in terms of BD-rate. The other is intra prediction which is an essential component in various VVC coding configurations. To improve the compression efficiency, several new intra coding features have been adopted in VVC. For prediction partition, QTMT is adopted while only QT was used in previous standard HEVC. For prediction mode, the number is as large as 67, while it was only 35 in HEVC. To reduce the number of partitions and modes requiring a complete rate-distortion computation process, histogram of oriented gradient information is used. As a result, compared with original VVC test model, about 69% encoding time can be saved with only 2.96% BD-rate overhead. To illustrate the universality of the proposal, the method is also implemented on VVenC and VVdeC. Both topics are accepted by peer-reviewed international journals.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason Within this year, I published two international journals on the topics of low-complexity VVC. One is about VVC intra prediction algorithm which is published in Journal of Visual Communication and Image Representation (Impact Factor: 2.6). The other is about VVC adaptive loop filter architecture which is published in IEEE Transactions on Circuits and Systems-II (Impact Factor: 4.4). In addition to journal papers, I have also published some international conferences on the related topics with VVC.
Strategy for Future Research Activity	I plan to explore several potential usages by VVC in the coming next year. One direction is to utilize LLM to help improving the coding efficiency of VVC. Another direction is to use VVC to compress neural network features rather than video.

Report

(3 results)

Research Products
(25 results)

All 2023 2022 2021 Other

All Int'l Joint Research (2 results) Journal Article (7 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 5 results, Open Access: 1 results) Presentation (16 results) (of which Int'l Joint Research: 16 results)

[Int'l Joint Research] Fudan University(中国)
- Related Report
  2023 Research-status Report
[Int'l Joint Research] Zhejiang Univerisity(中国)
- Related Report
  2022 Research-status Report
[Journal Article] Area-Efficient Processing Elements-Based Adaptive Loop Filter Architecture With Optimized Memory for VVC2023
- Author(s)
  Hao Zhijian、Sun Heming、Li Sirui、Zeng Xiaoyang、Fan Yibo
- Journal Title
  
  IEEE Transactions on Circuits and Systems II: Express Briefs
  
  Volume: 70 Issue: 11 Pages: 4231-4235
- DOI
  10.1109/tcsii.2023.3280167
- Related Report
  2023 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] A novel fast intra algorithm for VVC based on histogram of oriented gradient2023
- Author(s)
  Gou Aorui、Sun Heming、Liu Chao、Zeng Xiaoyang、Fan Yibo
- Journal Title
  
  Journal of Visual Communication and Image Representation
  
  Volume: 95 Pages: 103888-103888
- DOI
  10.1016/j.jvcir.2023.103888
- Related Report
  2023 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] A Highly Pipelined and Highly Parallel VLSI Architecture of CABAC Encoder for UHDTV Applications2023
- Author(s)
  Fu Chen、Sun Heming、Zhang Zhiqiang、Zhou Jinjia
- Journal Title
  
  Sensors
  
  Volume: 23 Issue: 9 Pages: 4293-4293
- DOI
  10.3390/s23094293
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] A Reconfigurable Multiple Transform Selection Architecture for VVC2023
- Author(s)
  Zhijian Hao, Heming Sun, Guoqing Xiang, Peng Zhang, Xiaoyang Zeng, Yibo Fan
- Journal Title
  
  IEEE Transactions on Very Large Scale Integration (VLSI) Systems
  
  Volume: 31 Issue: 5 Pages: 658-669
- DOI
  10.1109/tvlsi.2023.3245291
- Related Report
  2022 Research-status Report
[Journal Article] An Efficient Low-Complexity Convolutional Neural Network Filter2022
- Author(s)
  Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan
- Journal Title
  
  IEEE Multimedia
  
  Volume: 29 Issue: 2 Pages: 83-93
- DOI
  10.1109/mmul.2022.3159372
- Related Report
  2022 Research-status Report
[Journal Article] QA-Filter: A QP-Adaptive Convolutional Neural Network Filter for Video Coding2022
- Author(s)
  Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan
- Journal Title
  
  IEEE Transactions on Image Processing
  
  Volume: 31 Pages: 3032-3045
- DOI
  10.1109/tip.2022.3152627
- Related Report
  2021 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Learned Image Compression With Separate Hyperprior Decoders2021
- Author(s)
  Zhao Zan, Chao Liu, Heming Sun, Xiaoyang Zeng, Yibo Fan
- Journal Title
  
  IEEE Open Journal of Circuits and Systems
  
  Volume: 2 Pages: 627-632
- DOI
  10.1109/ojcas.2021.3125354
- Related Report
  2021 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Fast VVC Intra Encoding for Video Coding for Machines2023
- Author(s)
  Aorui Gou, Heming Sun, Xiaoyang Zeng, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems (ISCAS)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Improving Latent Quantization of Learned Image Compression with Gradient Scaling2022
- Author(s)
  Heming Sun, Lu Yu, Jiro Katto
- Organizer
  IEEE International Conference on Visual Communications and Image Processing (VCIP)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Semantic Segmentation In Learned Compressed Domain2022
- Author(s)
  Jinming Liu, Heming Sun, Jiro Katto
- Organizer
  Picture Coding Symposium (PCS)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] F-LIC: FPGA-based Learned Image Compression with a Fine-grained Pipeline2022
- Author(s)
  Heming Sun, Qingyang Yi, Fangzheng Lin, Lu Yu, Jiro Katto, Masahiro Fujita
- Organizer
  IEEE Asian Solid-State Circuits Conference (A-SSCC)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Streaming-Capable High-Performance Architecture of Learned Image Compression Codecs2022
- Author(s)
  Fangzheng Lin, Heming Sun, Jiro Katto
- Organizer
  IEEE International Conference on Image Processing (ICIP)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Memory-Efficient Learned Image Compression with Pruned Hyperprior Module2022
- Author(s)
  Ao Luo, Heming Sun, Jinming Liu, Jiro Katto
- Organizer
  IEEE International Conference on Image Processing (ICIP)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Improving Multiple Machine Vision Tasks in the Compressed Domain2022
- Author(s)
  Jinming Liu, Heming Sun, Jiro Katto
- Organizer
  International Conference on Pattern Recognition (ICPR)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Fast Intra Mode Decision for VVC Based on Histogram of Oriented Gradient2022
- Author(s)
  Aorui Gou, Heming Sun, Jiro Katto, Tingting Li, Xiaoyang Zeng, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems (ISCAS)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] An Area-efficient Unified Transform Architecture for VVC2022
- Author(s)
  Zhijian Hao, Qi Zheng, Yibo Fan, Guoqing Xiang, Peng Zhang, Heming Sun
- Organizer
  IEEE International Symposium on Circuits and Systems (ISCAS)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] A QP-adaptive mechanism for CNN-based filter in video coding2022
- Author(s)
  Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems (ISCAS)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Learning in Compressed Domain for Faster Machine Vision Tasks2021
- Author(s)
  Jinming Liu, Heming Sun, Jiro Katto
- Organizer
  IEEE International Conference on Visual Communications and Image Processing
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] A Hardware Architecture for Adaptive Loop Filter in VVC Decoder2021
- Author(s)
  Xin Wang, Heming Sun, Jiro Katto, Yibo Fan
- Organizer
  IEEE International Conference on ASIC
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Fast Object Detection in HEVC Intra Compressed Domain2021
- Author(s)
  Liuhong Chen, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan
- Organizer
  European Signal Processing Conference
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Learned Image Compression with Fixed-point Arithmetic2021
- Author(s)
  Heming Sun, Lu Yu, Jiro Katto
- Organizer
  Picture Coding Symposium
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Accelerating Convolutional Neural Network Inference Based on a Reconfigurable Sliced Systolic Array2021
- Author(s)
  Yixuan Zeng, Heming Sun, Jiro Katto, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Approximated Reconfigurable Transform Architecture for VVC2021
- Author(s)
  Yixuan Zeng, Heming Sun, Jiro Katto, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems
- Related Report
  2021 Research-status Report
- Int'l Joint Research

Low-complexity research for next-generation VVC standard and its neural network extension

Principal Investigator

孫 鶴鳴 横浜国立大学, 大学院工学研究院, 准教授 (90835886)

¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)

Current Status of Research Progress

Reason

Report

Research Products

[Int'l Joint Research] Fudan University(中国)

Related Report

[Int'l Joint Research] Zhejiang Univerisity(中国)

Related Report

[Journal Article] Area-Efficient Processing Elements-Based Adaptive Loop Filter Architecture With Optimized Memory for VVC2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A novel fast intra algorithm for VVC based on histogram of oriented gradient2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A Highly Pipelined and Highly Parallel VLSI Architecture of CABAC Encoder for UHDTV Applications2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A Reconfigurable Multiple Transform Selection Architecture for VVC2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] An Efficient Low-Complexity Convolutional Neural Network Filter2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] QA-Filter: A QP-Adaptive Convolutional Neural Network Filter for Video Coding2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Learned Image Compression With Separate Hyperprior Decoders2021

Author(s)

Journal Title

DOI

Related Report

[Presentation] Fast VVC Intra Encoding for Video Coding for Machines2023

Author(s)

Organizer

Related Report

[Presentation] Improving Latent Quantization of Learned Image Compression with Gradient Scaling2022

Author(s)

Organizer

Related Report

[Presentation] Semantic Segmentation In Learned Compressed Domain2022

Author(s)

Organizer

Related Report

[Presentation] F-LIC: FPGA-based Learned Image Compression with a Fine-grained Pipeline2022

Author(s)

Organizer

Related Report

[Presentation] Streaming-Capable High-Performance Architecture of Learned Image Compression Codecs2022

Author(s)

Organizer

Related Report

[Presentation] Memory-Efficient Learned Image Compression with Pruned Hyperprior Module2022

Author(s)

Organizer

Related Report

[Presentation] Improving Multiple Machine Vision Tasks in the Compressed Domain2022

Author(s)

Organizer

Related Report

[Presentation] Fast Intra Mode Decision for VVC Based on Histogram of Oriented Gradient2022

Author(s)

Organizer

Related Report

[Presentation] An Area-efficient Unified Transform Architecture for VVC2022

孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)