2022 Fiscal Year Annual Research Report

Development of Multimodal Data Retrieval Engine Based on Human Cognitive System

Research Project

Project/Area Number	19H04172
Research Institution	Osaka Gakuin University
Principal Investigator	上原邦昭大阪学院大学, 経営学部, 教授 (60160206)
Co-Investigator(Kenkyū-buntansha)	白浜公章近畿大学, 情報学部, 准教授 (30467675) 松原崇大阪大学, 大学院基礎工学研究科, 准教授 (70756197)
Project Period (FY)	2019-04-01 – 2023-03-31
Keywords	マルチモーダルデータ / 深層学習 / 機械学習
Outline of Annual Research Achievements	データ駆動型のマルチモーダルデータ検索エンジンにおいて，意味の曖昧性を表現するため，確率分布関数を用いた埋め込みを開発した．あるエンティティの意味が曖昧であるとは，複数の解釈ができるため広がりが大きく，逆に特定のエンティティとは正確にマッチングしない．確率分布で言えば分散が広いことに相当し，他の分布との擬距離が大きくなる．このような性質を埋め込みに与えることで，データ駆動による学習中に自動的に曖昧なエンティティの埋め込みの分散が大きくなり，いわば不要なエンティティを捨てるような機能が得られる．これによって，マルチモーダルデータ検索の信頼性を評価したり，検索に悪影響を与える不必要なデータの検出が可能となり，成果を雑誌論文としてまとめた．また画像がもつ意味が連続的に変化するような軌跡を同定する手法，データの曖昧さを用いて異常を検知する手法等を開発し国際会議に採録された．検索クエリを構文解析して得られた句構造木が表す単語・句を段階的にフレーム中の領域と対応づける手法を開発した．具体的には，句構造木の葉ノードが表す単語から始めて，下位のノードの単語（もしくは句）と領域との対応づけの結果を，上位のノードの句と領域の対応づけに伝播させていく．さらに，領域の視覚特徴を抽出するために，これまではBottom-up attentionというモデルを用いていたが，大幅に大規模なデータで学習され，かつより高度なネットワーク構造をもつVinVLというモデルに変更して検索精度を大幅に改善した．そして，世界的な映像検索コンテストTRECVID 2022に参加し，全7研究機関中，第4位の検索精度を達成した．また，本研究で開発したテキスト（単語・句）と画像（領域）という異種データを対応づける手法を，感情と音楽の対応づけ，時系列データの異なる時点での値の対応づけに応用した研究を国際ジャーナルで発表した．
Research Progress Status	令和4年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	令和4年度が最終年度であるため、記入しない。

Research Products
(27 results)

All 2023 2022

All Journal Article (4 results) (of which Int'l Joint Research: 2 results, Peer Reviewed: 4 results, Open Access: 3 results) Presentation (23 results) (of which Int'l Joint Research: 8 results)

[Journal Article] Embedding-based Music Emotion Recognition Using Composite Loss2023
- Author(s)
  Naoki Takashima, Frederic Li, Marcin Grzegorzek and Kimiaki Shirahama
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 36579-36604
- DOI
  10.1109/ACCESS.2023.3265807
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Sleep Stage Classification in Children Using Self-Attention and Gaussian Noise Data Augmentation2023
- Author(s)
  Xinyu Huang, Kimiaki Shirahama, Muhammad Tausif Irshad, Muhammad Adeel Nisar, Artur Piet and Marcin Grzegorzek
- Journal Title
  
  Sensors
  
  Volume: 23 Pages: -
- DOI
  10.3390/s23073446
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Multi-Modal Entity Alignment Using Uncertainty Quantification for Modality Importance2023
- Author(s)
  Kenta Hama, and Takashi Matsubara
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 28479-28489
- DOI
  10.1109/ACCESS.2023.3259987
- Peer Reviewed / Open Access
[Journal Article] Topology-Aware Flow-Based Point Cloud Generation2023
- Author(s)
  Takumi Kimura, Takashi Matsubara, and Kuniaki Uehara
- Journal Title
  
  IEEE Transactions on Circuits and Systems for Video Technology
  
  Volume: 32 Pages: 7967-7982
- DOI
  10.1109/TCSVT.2022.3181212
- Peer Reviewed
[Presentation] Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model2023
- Author(s)
  Takehiro Aoshima, Takashi Matsubara
- Organizer
  The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023
- Int'l Joint Research
[Presentation] Inverse Heat Dissipation Model for Image Segmentation2023
- Author(s)
  Yu Kashihara, Takashi Matsubara
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2023)
- Int'l Joint Research
[Presentation] Learning Attribute Curvilinear Coordinates for Pretrained Deep Generative Model2023
- Author(s)
  Takehiro Aoshima, Takashi Matsubara
- Organizer
  RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2023)
- Int'l Joint Research
[Presentation] Domain Adaptation for Japanese Sentence Embedding Models with Contrastive Learning2023
- Author(s)
  Zihao Chen, Hisashi Handa and Kimiaki Shirahama
- Organizer
  2023年電子情報通信学会総合大会
[Presentation] 深層生成モデルのための可換かつ非線形な画像編集2023
- Author(s)
  青嶋雄大，松原崇
- Organizer
  人工知能学会全国大会 (JSAI2023)
[Presentation] 距離情報を含む画像のセグメンテーションのための距離同変畳み込み2023
- Author(s)
  丸茂英敬，松原崇
- Organizer
  67回システム制御情報学会研究発表講演会 (SCI2023)
[Presentation] 逆熱拡散モデルによる医療画像セグメンテーション2023
- Author(s)
  柏原悠，松原崇
- Organizer
  電子情報通信学会複雑コミュニケーションサイエンス研究会
[Presentation] 深層生成モデルの潜在空間における可換な属性ベクトル場の学習2023
- Author(s)
  青嶋雄大，松原崇
- Organizer
  電子情報通信学会複雑コミュニケーションサイエンス研究会
[Presentation] Kindai University, Osaka Gakuin University and Osaka University at TRECVID 2022 AVS Task2022
- Author(s)
  Kimiaki Shirahama, Kazuma Fujioka, Taichi Shinno, Takashi Matsubara and Kuniaki Uehara
- Organizer
  Proc. of TREC Video Retrieval Evaluation (TRECVID) 2022
- Int'l Joint Research
[Presentation] Application of Denoising Image Restoration to Anomaly Detection2022
- Author(s)
  Yu Kashihara, Takashi Matsubara
- Organizer
  International Symposium on Nonlinear Theory and Its Applications (NOLTA2022)
- Int'l Joint Research
[Presentation] Common Space Learning with Gaussian Embedding for Multi-Modal Entity Alignment2022
- Author(s)
  Kenta Hama, Takashi Matsubara
- Organizer
  International Symposium on Nonlinear Theory and Its Applications (NOLTA2022)
- Int'l Joint Research
[Presentation] Nonlinear and Commutative Editing in Pretrained GAN Latent Space2022
- Author(s)
  Takehiro Aoshima, Takashi Matsubara
- Organizer
  NeurIPS 2022 Workshop on Symmetry and Geometry in Neural Representations
- Int'l Joint Research
[Presentation] Toward Human Cognition-inspired High-Level Decision Making For Hierarchical Reinforcement Learning Agents2022
- Author(s)
  Rousslan Fernand Julien Dossa, Takashi Matsubara
- Organizer
  Proceedings of the Decision Awareness in Reinforcement Learning at the Internal Conference on Machine Learning 2022
- Int'l Joint Research
[Presentation] StyleGAN3を用いた表情生成と感情因子の抽出2022
- Author(s)
  水野翔太，白浜公章
- Organizer
  計測自動制御学会　システム・情報部門学術講演会2022（SSI 2022）
[Presentation] Retrieval of Similar Questions from QAbot Data based on Transformer Language Model2022
- Author(s)
  Zihao Chen, Hisashi Handa and Kimiaki Shirahama
- Organizer
  教育工学研究会
[Presentation] ANの潜在空間における非線形な属性座標系の学習2022
- Author(s)
  青嶋雄大, 松原崇
- Organizer
  25回情報論的学習理論ワークショップ (IBIS2022)
[Presentation] 補足情報の重要性を考慮したマルチモーダル知識グラフ埋め込み2022
- Author(s)
  濱健太, 松原崇
- Organizer
  25回情報論的学習理論ワークショップ (IBIS2022)
[Presentation] ANの意味空間における属性ベクトル場の学習2022
- Author(s)
  青嶋雄大，松原崇
- Organizer
  電子情報通信学会情報論的学習理論と機械学習研究会(IBISML)
[Presentation] 拡散モデルによる拡散を使用しない異常検知2022
- Author(s)
  柏原悠，松原崇
- Organizer
  人工知能学会全国大会 (JSAI2022)
[Presentation] 確率分布を用いたマルチモーダル知識グラフの共通空間学習2022
- Author(s)
  濱健太，松原崇
- Organizer
  人工知能学会全国大会 (JSAI2022)
[Presentation] DAR点群のセグメンテーションのための距離同変畳み込み2022
- Author(s)
  丸茂英敬，松原崇
- Organizer
  人工知能学会全国大会 (JSAI2022)
[Presentation] 拡散を使用しない拡散モデルによる異常検知2022
- Author(s)
  柏原悠，松原崇
- Organizer
  電子情報通信学会 NOLTAソサイエティ大会 2022
[Presentation] 球面投射されたLiDAR点群のための距離同変畳み込み2022
- Author(s)
  丸茂英敬，松原崇
- Organizer
  電子情報通信学会 NOLTAソサイエティ大会 2022

2022 Fiscal Year Annual Research Report

Development of Multimodal Data Retrieval Engine Based on Human Cognitive System

Principal Investigator

上原 邦昭 大阪学院大学, 経営学部, 教授 (60160206)

Research Products

[Journal Article] Embedding-based Music Emotion Recognition Using Composite Loss2023

Author(s)

Journal Title

DOI

[Journal Article] Sleep Stage Classification in Children Using Self-Attention and Gaussian Noise Data Augmentation2023

Author(s)

Journal Title

DOI

[Journal Article] Multi-Modal Entity Alignment Using Uncertainty Quantification for Modality Importance2023

Author(s)

Journal Title

DOI

[Journal Article] Topology-Aware Flow-Based Point Cloud Generation2023

Author(s)

Journal Title

DOI

[Presentation] Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model2023

Author(s)

Organizer

[Presentation] Inverse Heat Dissipation Model for Image Segmentation2023

Author(s)

Organizer

[Presentation] Learning Attribute Curvilinear Coordinates for Pretrained Deep Generative Model2023

Author(s)

Organizer

[Presentation] Domain Adaptation for Japanese Sentence Embedding Models with Contrastive Learning2023

Author(s)

Organizer

[Presentation] 深層生成モデルのための可換かつ非線形な画像編集2023

Author(s)

Organizer

[Presentation] 距離情報を含む画像のセグメンテーションのための距離同変畳み込み2023

Author(s)

Organizer

[Presentation] 逆熱拡散モデルによる医療画像セグメンテーション2023

Author(s)

Organizer

[Presentation] 深層生成モデルの潜在空間における可換な属性ベクトル場の学習2023

Author(s)

Organizer

[Presentation] Kindai University, Osaka Gakuin University and Osaka University at TRECVID 2022 AVS Task2022

Author(s)

Organizer

[Presentation] Application of Denoising Image Restoration to Anomaly Detection2022

Author(s)

Organizer

[Presentation] Common Space Learning with Gaussian Embedding for Multi-Modal Entity Alignment2022

Author(s)

Organizer

[Presentation] Nonlinear and Commutative Editing in Pretrained GAN Latent Space2022

Author(s)

Organizer

[Presentation] Toward Human Cognition-inspired High-Level Decision Making For Hierarchical Reinforcement Learning Agents2022

Author(s)

Organizer

[Presentation] StyleGAN3を用いた表情生成と感情因子の抽出2022

Author(s)

Organizer

[Presentation] Retrieval of Similar Questions from QAbot Data based on Transformer Language Model2022

Author(s)

Organizer

[Presentation] ANの潜在空間における非線形な属性座標系の学習2022

Author(s)

Organizer

[Presentation] 補足情報の重要性を考慮したマルチモーダル知識グラフ埋め込み2022

Author(s)

Organizer

[Presentation] ANの意味空間における属性ベクトル場の学習2022

Author(s)

Organizer

[Presentation] 拡散モデルによる拡散を使用しない異常検知2022

Author(s)

Organizer

[Presentation] 確率分布を用いたマルチモーダル知識グラフの共通空間学習2022

Author(s)

上原邦昭大阪学院大学, 経営学部, 教授 (60160206)