2023 年度実施状況報告書

学習型静止画像圧縮の実用化に関する研究

研究課題

研究課題/領域番号	23K16861
研究機関	横浜国立大学
研究代表者	孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)
研究期間 (年度)	2023-04-01 – 2026-03-31
キーワード	Image Compression
研究実績の概要	This year, I mainly worked on two topics. First is the algorithm-architecture co-optimization of learned image compression (LIC) on FPGA. Based on a pipelined architecture, the input and output channel parallelism is restricted to ease the routing phase, so that more DSP can be used. After that, the neural network channel is searched to improve the DSP efficiency. With high DSP utilization and efficiency, compared with the recent work, the throughput can be improved by at most 1.5x. Besides, the compression efficiency will not be affected after the neural network search. Second is the privacy of LIC for machine. The overall framework includes the client side which captures image and the cloud side which conducts the machine vision. To avoid the privacy leakage in the cloud side, feature of captured image is generated at the client side, coded by an LIC autoencoder and sent to the cloud side. The cloud side then decodes the feature and performs the machine vision based on the decoded feature. By optimizing the layer number of feature, we can not only reach a good trade-off between rate for the feature transmission and accuracy for the machine vision, but also avoid leaking the privacy information in the cloud.
現在までの達成度 (区分)	現在までの達成度 (区分) 2: おおむね順調に進展している理由 This year, I have presented one domestic conference at IPSJ. In addition, one paper at IEEE international conference ISCAS and one paper at IEEE Journal on Emerging and Selected Topics in Circuits and Systems have been accepted. The domestic conference and journal paper are about the algorithm-architecture co-optimization of LIC. ISCAS paper is about the privacy of LIC on the usage of machine vision. Therefore, the research is undergoing well as scheduled.
今後の研究の推進方策	There are two plans. First is to realize the variable bitrate (VBR). Currently, multiple LIC models are required to support VBR, which is very memory consuming. To solve this problem, the most efficient way is to add a gain unit which is learnable through the training. However, adding the gain unit cannot realize an effective result on the high bitrate. To solve this problem, we plan to add the gain unit not only in the bottleneck layer, but also in the intermediate layers. Second is to improve the robustness of LIC network. Since the attacker can use FGSM to generate the adversarial samples. The bit consumption will become larger or the reconstruction quality will become worse. To solve this problem, we plan to use adversarial training to include the adversarial samples in the training.
次年度使用額が生じた理由	Last year, I utilized the GPU server I purchased earlier thus I did not buy a new one. This year, I plan to purchase a new GPU server for my lab.

研究成果
(6件)

すべて 2024 2023 その他

すべて国際共同研究 (1件) 雑誌論文 (2件) (うち国際共著 1件、査読あり 2件、オープンアクセス 1件) 学会発表 (3件) (うち国際学会 2件)

[国際共同研究] University of Bristol(英国)
- 国名
  英国
- 外国機関名
  University of Bristol
[雑誌論文] FPGA Codec System of Learned Image Compression with Algorithm-Architecture Co-Optimization2024
- 著者名/発表者名
  Sun Heming、Yi Qingyang、Fujita Masahiro
- 雑誌名
  
  IEEE Journal on Emerging and Selected Topics in Circuits and Systems
  
  巻: 14 ページ: 1～14
- DOI
  10.1109/JETCAS.2024.3386328
- 査読あり
[雑誌論文] Learned Lossless Image Compression With Combined Channel-Conditioning Models and Autoregressive Modules2023
- 著者名/発表者名
  Wang Ran、Liu Jinming、Sun Heming、Katto Jiro
- 雑誌名
  
  IEEE Access
  
  巻: 11 ページ: 73462～73469
- DOI
  10.1109/ACCESS.2023.3291591
- 査読あり / オープンアクセス / 国際共著
[学会発表] Privacy-preserving with Flexible Autoencoder for Video Coding for Machines2024
- 著者名/発表者名
  Aorui Gou, Heming Sun, Xiaoyang Zeng, Yibo Fan
- 学会等名
  IEEE International Symposium on Circuits and Systems (ISCAS)
- 国際学会
[学会発表] Learned Image Compression with Algorithm-Architecture Co-Optimization2023
- 著者名/発表者名
  Heming Sun, Qingyang Yi, Masahiro Fujita
- 学会等名
  IPSJ-SLDM
[学会発表] PTS-LIC: Pruning Threshold Searching for Lightweight Learned Image Compression2023
- 著者名/発表者名
  Ao Luo, Heming Sun, Jinming Liu, Fangzheng Lin, Jiro Katto
- 学会等名
  IEEE International Conference on Visual Communications and Image Professing (VCIP)
- 国際学会

2023 年度 実施状況報告書

学習型静止画像圧縮の実用化に関する研究

研究代表者

孫 鶴鳴 横浜国立大学, 大学院工学研究院, 准教授 (90835886)

現在までの達成度 (区分)

理由

研究成果

[国際共同研究] University of Bristol(英国)

国名

外国機関名

[雑誌論文] FPGA Codec System of Learned Image Compression with Algorithm-Architecture Co-Optimization2024

著者名/発表者名

雑誌名

DOI

[雑誌論文] Learned Lossless Image Compression With Combined Channel-Conditioning Models and Autoregressive Modules2023

著者名/発表者名

雑誌名

DOI

[学会発表] Privacy-preserving with Flexible Autoencoder for Video Coding for Machines2024

著者名/発表者名

学会等名

[学会発表] Learned Image Compression with Algorithm-Architecture Co-Optimization2023

著者名/発表者名

学会等名

[学会発表] PTS-LIC: Pruning Threshold Searching for Lightweight Learned Image Compression2023

著者名/発表者名

学会等名

2023 年度実施状況報告書

孫鶴鳴横浜国立大学, 大学院工学研究院, 准教授 (90835886)