学習型静止画像圧縮の実用化に関する研究

Research Project

Project/Area Number	23K16861
Research Category	Grant-in-Aid for Early-Career Scientists
Allocation Type	Multi-year Fund
Review Section	Basic Section 60040:Computer system-related
Research Institution	Yokohama National University
Principal Investigator	孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)
Project Period (FY)	2023-04-01 – 2026-03-31
Project Status	Granted (Fiscal Year 2023)
Budget Amount *help	¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000) Fiscal Year 2025: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2024: ¥1,820,000 (Direct Cost: ¥1,400,000、Indirect Cost: ¥420,000) Fiscal Year 2023: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000)
Keywords	Image Compression / 画像圧縮 / 深層学習
Outline of Research at the Start	Learned image compression (LIC) has shown a superior coding ability than the traditional image compression standards. However, current LIC is still not practical due to several problems. This research aims at a practical LIC system by the co-development of both algorithm and architecture.
Outline of Annual Research Achievements	This year, I mainly worked on two topics. First is the algorithm-architecture co-optimization of learned image compression (LIC) on FPGA. Based on a pipelined architecture, the input and output channel parallelism is restricted to ease the routing phase, so that more DSP can be used. After that, the neural network channel is searched to improve the DSP efficiency. With high DSP utilization and efficiency, compared with the recent work, the throughput can be improved by at most 1.5x. Besides, the compression efficiency will not be affected after the neural network search. Second is the privacy of LIC for machine. The overall framework includes the client side which captures image and the cloud side which conducts the machine vision. To avoid the privacy leakage in the cloud side, feature of captured image is generated at the client side, coded by an LIC autoencoder and sent to the cloud side. The cloud side then decodes the feature and performs the machine vision based on the decoded feature. By optimizing the layer number of feature, we can not only reach a good trade-off between rate for the feature transmission and accuracy for the machine vision, but also avoid leaking the privacy information in the cloud.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason This year, I have presented one domestic conference at IPSJ. In addition, one paper at IEEE international conference ISCAS and one paper at IEEE Journal on Emerging and Selected Topics in Circuits and Systems have been accepted. The domestic conference and journal paper are about the algorithm-architecture co-optimization of LIC. ISCAS paper is about the privacy of LIC on the usage of machine vision. Therefore, the research is undergoing well as scheduled.
Strategy for Future Research Activity	There are two plans. First is to realize the variable bitrate (VBR). Currently, multiple LIC models are required to support VBR, which is very memory consuming. To solve this problem, the most efficient way is to add a gain unit which is learnable through the training. However, adding the gain unit cannot realize an effective result on the high bitrate. To solve this problem, we plan to add the gain unit not only in the bottleneck layer, but also in the intermediate layers. Second is to improve the robustness of LIC network. Since the attacker can use FGSM to generate the adversarial samples. The bit consumption will become larger or the reconstruction quality will become worse. To solve this problem, we plan to use adversarial training to include the adversarial samples in the training.

Report

(1 results)

2023 Research-status Report

Research Products
(6 results)

All 2024 2023 Other

All Int'l Joint Research (1 results) Journal Article (2 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 2 results, Open Access: 1 results) Presentation (3 results) (of which Int'l Joint Research: 2 results)

[Int'l Joint Research] University of Bristol(英国)
- Related Report
  2023 Research-status Report
[Journal Article] FPGA Codec System of Learned Image Compression with Algorithm-Architecture Co-Optimization2024
- Author(s)
  Sun Heming、Yi Qingyang、Fujita Masahiro
- Journal Title
  
  IEEE Journal on Emerging and Selected Topics in Circuits and Systems
  
  Volume: 14 Issue: 2 Pages: 1-14
- DOI
  10.1109/jetcas.2024.3386328
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Learned Lossless Image Compression With Combined Channel-Conditioning Models and Autoregressive Modules2023
- Author(s)
  Wang Ran、Liu Jinming、Sun Heming、Katto Jiro
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 73462-73469
- DOI
  10.1109/access.2023.3291591
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Privacy-preserving with Flexible Autoencoder for Video Coding for Machines2024
- Author(s)
  Aorui Gou, Heming Sun, Xiaoyang Zeng, Yibo Fan
- Organizer
  IEEE International Symposium on Circuits and Systems (ISCAS)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Learned Image Compression with Algorithm-Architecture Co-Optimization2023
- Author(s)
  Heming Sun, Qingyang Yi, Masahiro Fujita
- Organizer
  IPSJ-SLDM
- Related Report
  2023 Research-status Report
[Presentation] PTS-LIC: Pruning Threshold Searching for Lightweight Learned Image Compression2023
- Author(s)
  Ao Luo, Heming Sun, Jinming Liu, Fangzheng Lin, Jiro Katto
- Organizer
  IEEE International Conference on Visual Communications and Image Professing (VCIP)
- Related Report
  2023 Research-status Report
- Int'l Joint Research

学習型静止画像圧縮の実用化に関する研究

Principal Investigator

孫 鶴鳴 横浜国立大学, 大学院工学研究院, 准教授 (90835886)

¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)

Current Status of Research Progress

Reason

Report

Research Products

[Int'l Joint Research] University of Bristol(英国)

Related Report

[Journal Article] FPGA Codec System of Learned Image Compression with Algorithm-Architecture Co-Optimization2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Learned Lossless Image Compression With Combined Channel-Conditioning Models and Autoregressive Modules2023

Author(s)

Journal Title

DOI

Related Report

[Presentation] Privacy-preserving with Flexible Autoencoder for Video Coding for Machines2024

Author(s)

Organizer

Related Report

[Presentation] Learned Image Compression with Algorithm-Architecture Co-Optimization2023

Author(s)

Organizer

Related Report

[Presentation] PTS-LIC: Pruning Threshold Searching for Lightweight Learned Image Compression2023

Author(s)

Organizer

Related Report

孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)