深層学習による主観的高画質化を目指した新しい圧縮技術の研究

Research Project

Project/Area Number	19J14620
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Review Section	Basic Section 61010:Perceptual information processing-related
Research Institution	Waseda University
Principal Investigator	CHENG ZHENGXUE 早稲田大学, 基幹理工学研究科, 特別研究員(PD)
Project Period (FY)	2019-04-25 – 2021-03-31
Project Status	Declined (Fiscal Year 2020)
Budget Amount *help	¥1,700,000 (Direct Cost: ¥1,700,000) Fiscal Year 2020: ¥800,000 (Direct Cost: ¥800,000) Fiscal Year 2019: ¥900,000 (Direct Cost: ¥900,000)
Keywords	画像圧縮 / 深層学習 / 高画質化
Outline of Research at the Start	画像圧縮技術は、マルチメディアコンテンツの効率的な保存と伝送を可能とする基礎的な研究分野として知られている。従来の画像圧縮技術には、JPEG、JPEG2000、HEVCイントラがあるが、超高解像度、3D、360度やVRといったマルチメディアコンテンツのさらなる大容量化に伴い、より高効率な画像圧縮技術が求められている。本研究は、システム全体の設計して、深層学習を適用しエンドユーエンド学習を可能することに独創性があると言える。ユーザーの感じる主観画質の高画質化を目的とし、深層学習を活用した次世代の画像圧縮技術について研究開発を通して、画像圧縮技術のさらなる発展に貢献する。
Outline of Annual Research Achievements	最近超高解像度、360度といったマルチメディアコンテンツのさらなる大容量化に伴い、より高効率な圧縮技術が求められている。令和1年度は、前述した目的を達成するために、従来の映像符号化標準に従い、提案するユーザーの感じる主観画質の高画質化を目的とし、CVPRとICIP国際学会で筆頭著者として発表を行うと共に、査読付き論文誌と令和２年度の国際学会ICASSPとCVPRへの採択が確定していた。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason So far, the progress is conducted smoothly according to my plan. I have finished my target in the first year. And I have seperate my goals and future plan into two parts as followed. Regarding deep learning based lossy image compression, I have reached a milestone, that is, reaching the performance of HEVC on both objective quality metrics and subjective quality, and the next plan for is to outperform VVC. Besides, some of our experiments have validate the possibility to achieve this target. Regarding deep learning based lossy video compression, I have reached a milestone to achieve comparable performance with H.264. The next step is to outperform HEVC, or even VVC. Specially, my plan is to develop better P-frame prediction and B-frame prediction algorithms and incorporate them into inter-encoded video compression algorithms.
Strategy for Future Research Activity	To promote the development of research, we are considering the following two strategies. The first is to promote the learned image compression. I will develop a better entropy model by extending current single Gaussian to Gaussian mixture likelihoods. Second, I will enhance the network structure by consider some novel modules such as attention. The CVPR, a flagship meeting in the field of computer vision will have the event "Workshop and Challenge on Learned Image Compression” (CLIC) and I will participate in the challenge. Not only I can compare my results with research teams from other universities and companies, but also I could compare with conventional compression standards, because some hybrid methods are also submitted to this challenge. The second is to enhance the performance of learned video compression. My plan is to enhance the overall performance module-by-module. Then what I can do include the better video prediction (P-frames) algorithms, better video interpolation algorithms (B-frames) and better network to encode residual and motion vectors etc. Based on the knowledge of learned image compression, I can extend the entropy models to videos and combine all these modules together. Finally, I can achieve better results than my current result. Finally, through this research and development, we can contribute to the further development of compression technology.

Report

(1 results)

2019 Annual Research Report

Research Products
(9 results)

All 2020 2019

All Journal Article (5 results) (of which Int'l Joint Research: 5 results, Peer Reviewed: 5 results, Open Access: 1 results) Presentation (4 results) (of which Int'l Joint Research: 3 results)

[Journal Article] Energy Compaction-Based Image Compression Using Convolutional AutoEncoder2020
- Author(s)
  Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto
- Journal Title
  
  IEEE Transactions on Multimedia
  
  Volume: 22 Issue: 4 Pages: 860-873
- DOI
  10.1109/tmm.2019.2938345
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks2020
- Author(s)
  Heming Sun, Zhengxue Cheng, Masaru Takeuchi, Jiro Katto
- Journal Title
  
  IEEE Transactions on Multimedia
  
  Volume: NA Issue: 11 Pages: 1-14
- DOI
  10.1109/tmm.2019.2963620
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Methods for Adaptive Video Streaming and Picture Quality Assessment to Improve QoS/QoE Performances2019
- Author(s)
  Kenji KANAI, Bo WEI, Zhengxue CHENG, Masaru TAKEUCHI, Jiro KATTO
- Journal Title
  
  IEICE Transactions on Communications
  
  Volume: E102.B Issue: 7 Pages: 1240-1247
- DOI
  10.1587/transcom.2018ANI0003
- NAID
  130007671372
- ISSN
  0916-8516, 1745-1345
- Year and Date
  2019-07-01
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Perceptual Quality Study on Deep Learning Based Image Compression2019
- Author(s)
  Zhengxue Cheng, Pinar Akyazi, Heming Sun, Jiro Katto, Touradj Ehrahimi
- Journal Title
  
  Proceedings of IEEE International Conference on Image Processing (ICIP)
  
  Volume: 1 Pages: 1-5
- DOI
  10.1109/icip.2019.8803824
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Learning Image and Video Compression through Spatial-Temporal Energy Compaction2019
- Author(s)
  Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto
- Journal Title
  
  Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  
  Volume: 1 Pages: 1-10
- DOI
  10.1109/cvpr.2019.01031
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Presentation] Perceptual Quality Study on Deep Learning based Image Compression2020
- Author(s)
  Zhengxue Cheng
- Organizer
  2020年電子情報通信総合大会
- Related Report
  2019 Annual Research Report
[Presentation] Perceptual Quality Study on Deep Learning Based Image Compression2019
- Author(s)
  Zhengxue Cheng
- Organizer
  IEEE International Conference on Image Processing (ICIP)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Learning Image and Video Compression through Spatial-Temporal Energy Compaction2019
- Author(s)
  Zhengxue Cheng
- Organizer
  2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Deep Residual Learning for Image Compression2019
- Author(s)
  Zhengxue Cheng
- Organizer
  2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshop
- Related Report
  2019 Annual Research Report
- Int'l Joint Research

深層学習による主観的高画質化を目指した新しい圧縮技術の研究

Principal Investigator

CHENG ZHENGXUE 早稲田大学, 基幹理工学研究科, 特別研究員(PD)

¥1,700,000 (Direct Cost: ¥1,700,000)

Current Status of Research Progress

Reason

Report

Research Products

[Journal Article] Energy Compaction-Based Image Compression Using Convolutional AutoEncoder2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Methods for Adaptive Video Streaming and Picture Quality Assessment to Improve QoS/QoE Performances2019

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Perceptual Quality Study on Deep Learning Based Image Compression2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Learning Image and Video Compression through Spatial-Temporal Energy Compaction2019

Author(s)

Journal Title

DOI

Related Report

[Presentation] Perceptual Quality Study on Deep Learning based Image Compression2020

Author(s)

Organizer

Related Report

[Presentation] Perceptual Quality Study on Deep Learning Based Image Compression2019

Author(s)

Organizer

Related Report

[Presentation] Learning Image and Video Compression through Spatial-Temporal Energy Compaction2019

Author(s)

Organizer

Related Report

[Presentation] Deep Residual Learning for Image Compression2019

Author(s)

Organizer

Related Report