2022 Fiscal Year Research-status Report

情報リークを用いた深層学習の高度化

Research Project

Project/Area Number	21K11971
Research Institution	Meijo University
Principal Investigator	堀田一弘名城大学, 理工学部, 教授 (40345426)
Project Period (FY)	2021-04-01 – 2024-03-31
Keywords	深層学習 / Feedback / 情報リーク / セグメンテーション / Transformer
Outline of Annual Research Achievements	情報リークという観点から様々な研究を行っている。例えば、Deep Neural Networkの出力を入力側にFeedbackして再度推論しながらセマンティックセグメンテーションを行う方法を提案した。また、一般に、動画像認識では3D CNNが利用されるが、その内部で情報をFeedbackさせる方法も提案した。これにより、時系列情報を有効に活用することができると考えられる。時系列画像のセマンティックセグメンテーションを行う際、教師付き画像と時系列的に近い画像は高い精度でセグメンテーションができる。この性質を利用し、教師付き画像の所から時系列順に疑似ラベルを付与し、それらを教師データとして再利用しながら学習を行っていく方法も提案した。これは時系列的に情報をリークしていく枠組みと考えられる。また、セグメンテーションではクラス数が増えると精度が悪くなる傾向がある。そこで、あるクラスとその他の2クラス識別を行うDeep Neural Networkを学習し、そこから知識蒸留をしながら多クラスセグメンテーションの精度を改善する自己蒸留法も提案した。これは2クラス識別器からの情報リークと言える。また、最近ではTransformerの有効性が多数報告されているので、今後のベースとなるTransformerの改良も行っている。画像には自然言語のように明確に単語という概念がないため、Vision Transformerでは入力画像を局所領域に切り出したものを単語のように扱っている。これを改善するために、学習画像の局所領域をクラスタリングしたものを単語と考え、Vision Transformerの中に組み込んだ。これはVisual Wordから入力画像への情報リークと考えることができる。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 様々な画像認識問題に対し、情報リークという観点から様々な方法を提案し、有効性を確認できている。この点からおおむね順調に進展していると言える。
Strategy for Future Research Activity	今後も情報リークという観点から、画像分類、セマンティックセグメンテーション、動画像認識などの様々な画像認識問題に対してアプローチしていく。最近はTransformerを用いた方法の有効性の報告が多いため、Transformerに対して情報リークの観点からさらなる改善をしていく。
Causes of Carryover	国際会議に採択されたが、コロナが収束していない状況だったため、オンライン発表にした。そのため、旅費を利用せずに残額が生じた。今年は対面式の国際会議がメインになっているので、旅費に利用する。

Research Products
(14 results)

All 2023 2022

All Journal Article (1 results) (of which Peer Reviewed: 1 results, Open Access: 1 results) Presentation (13 results) (of which Int'l Joint Research: 6 results)

[Journal Article] Cell image segmentation by using feedback and convolutional LSTM2022
- Author(s)
  Eisuke Shibuya, Kazuhiro Hotta
- Journal Title
  
  The Visual Computer
  
  Volume: 38 Pages: 3791-3801
- DOI
  10.1007/s00371-021-02221-3
- Peer Reviewed / Open Access
[Presentation] Shuffle Mixing: An Efficient Alternative to Self Attention2023
- Author(s)
  R.Furukawa and K.Hotta
- Organizer
  18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
- Int'l Joint Research
[Presentation] Improvement of Vision Transformer Using Word Patches2023
- Author(s)
  A.Takama, S.Kato, S.Kamiya, and K.Hotta
- Organizer
  18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
- Int'l Joint Research
[Presentation] Semantic Segmentation by Semi-Supervised Learning Using Time Series Constraint2023
- Author(s)
  T.Mano, S.Kato and K.Hotta
- Organizer
  18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
- Int'l Joint Research
[Presentation] Class-wise Knowledge Distillation for Lightweight Segmentation Model2023
- Author(s)
  R.Ikedo, K.Nagata, and K.Hotta
- Organizer
  16th International Conference on Bio-inspired Systems and Signal Processing
- Int'l Joint Research
[Presentation] LOCAL EMBEDDING FOR AXIAL ATTENTION2023
- Author(s)
  R.Furukawa and K.Hotta
- Organizer
  IEEE International Conference on Image Processing
- Int'l Joint Research
[Presentation] Predicting Human Behavior Using 3D Loop ResNet2023
- Author(s)
  Y.Kakamu and K.Hotta
- Organizer
  International Conference on Pattern Recognition
- Int'l Joint Research
[Presentation] Shuffle Mixing:自己注意の効率的な代替手法2022
- Author(s)
  古川諒一，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] 時系列の制約を用いた半教師学習によるセマンティックセグメンテーション2022
- Author(s)
  真野嵩大，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] Object Queriesを用いたVision Transformer の精度向上2022
- Author(s)
  髙間斐斗，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] 対照学習を用いた継続学習における破滅的忘却の軽減と学習の強化2022
- Author(s)
  永田耕太郎，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] 自己蒸留によるセマンティックセグメンテーションの精度向上2022
- Author(s)
  安藤嵩将，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] 注意機構を用いた自己蒸留によるセマンティックセグメンテーション2022
- Author(s)
  池戸僚汰，永田耕太郎，堀田一弘
- Organizer
  画像の認識・理解シンポジウム
[Presentation] Axial Attention のための局所埋め込み2022
- Author(s)
  古川諒一，堀田一弘
- Organizer
  画像センシングシンポジウム

2022 Fiscal Year Research-status Report

情報リークを用いた深層学習の高度化

Principal Investigator

堀田 一弘 名城大学, 理工学部, 教授 (40345426)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Cell image segmentation by using feedback and convolutional LSTM2022

Author(s)

Journal Title

DOI

[Presentation] Shuffle Mixing: An Efficient Alternative to Self Attention2023

Author(s)

Organizer

[Presentation] Improvement of Vision Transformer Using Word Patches2023

Author(s)

Organizer

[Presentation] Semantic Segmentation by Semi-Supervised Learning Using Time Series Constraint2023

Author(s)

Organizer

[Presentation] Class-wise Knowledge Distillation for Lightweight Segmentation Model2023

Author(s)

Organizer

[Presentation] LOCAL EMBEDDING FOR AXIAL ATTENTION2023

Author(s)

Organizer

[Presentation] Predicting Human Behavior Using 3D Loop ResNet2023

Author(s)

Organizer

[Presentation] Shuffle Mixing:自己注意の効率的な代替手法2022

Author(s)

Organizer

[Presentation] 時系列の制約を用いた半教師学習によるセマンティックセグメンテーション2022

Author(s)

Organizer

[Presentation] Object Queriesを用いたVision Transformer の精度向上2022

Author(s)

Organizer

[Presentation] 対照学習を用いた継続学習における破滅的忘却の軽減と学習の強化2022

Author(s)

Organizer

[Presentation] 自己蒸留によるセマンティックセグメンテーションの精度向上2022

Author(s)

Organizer

[Presentation] 注意機構を用いた自己蒸留によるセマンティックセグメンテーション2022

Author(s)

Organizer

[Presentation] Axial Attention のための局所埋め込み2022

Author(s)

Organizer

堀田一弘名城大学, 理工学部, 教授 (40345426)