2022 Fiscal Year Research-status Report

Development of learning subspace-based methods for pattern recognition

Research Project

Project/Area Number	22K17960
Research Institution	National Institute of Advanced Industrial Science and Technology
Principal Investigator	SALESDESOUZA LINCON 国立研究開発法人産業技術総合研究所, 情報・人間工学領域, 研究員 (40912481)
Project Period (FY)	2022-04-01 – 2026-03-31
Keywords	Subspace learning / Deep neural networks / Manifold optimization
Outline of Annual Research Achievements	In fiscal year 2022, we worked to address the problem that traditional deep neural network frameworks process image sets independently, without considering the underlying feature distribution and the variance of the images in the set. To overcome this limitation, we devised a new subspace learning method called Grassmannian learning mutual subspace method (G-LMSM), which is an NN layer that can be integrated into deep neural networks. G-LMSM maps the image set into a low-dimensional input subspace representation, which is then matched with dictionary subspaces using a similarity metric of their canonical angles, an interpretable and computationally efficient metric. The key idea of G-LMSM is to learn dictionary subspaces as points on the Grassmann manifold, which is a smooth, non-linear manifold that captures the geometric structure of subspaces. This learning is optimized with Riemannian stochastic gradient descent, which is stable, efficient, and theoretically well-grounded. The proposed method was evaluated on three different tasks: hand shape recognition, face identification, and facial emotion recognition. Our experimental results showed that G-LMSM outperformed state-of-the-art methods on all three tasks, demonstrating its potential to improve the performance of deep frameworks for object recognition from image sets.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason Reason: We were able to combine subspace learning and deep neural networks to improve the performance in tasks of image set recognition.
Strategy for Future Research Activity	We will work on new ways to combine subspace learning and deep neural network that can address their problems and improve performance.
Causes of Carryover	Reason: To cover cloud computing costs to use in the next fiscal year and to attend conferences. Plan: We plan to purchase cloud computing that can process large-scale data at high speed and attend conferences to gather necessary information on the latest technologies and/or present our research.

Research Products
(4 results)

All 2023 2022

All Journal Article (2 results) (of which Int'l Joint Research: 2 results, Peer Reviewed: 2 results, Open Access: 1 results) Presentation (2 results) (of which Int'l Joint Research: 1 results)

[Journal Article] Grassmannian learning mutual subspace method for image set recognition2023
- Author(s)
  Souza Lincon S.、Sogi Naoya、Gatto Bernardo B.、Kobayashi Takumi、Fukui Kazuhiro
- Journal Title
  
  Neurocomputing
  
  Volume: 517 Pages: 20～33
- DOI
  10.1016/j.neucom.2022.10.040
- Peer Reviewed / Int'l Joint Research
[Journal Article] Temporal-stochastic tensor features for action recognition2022
- Author(s)
  Batalo Bojan、Souza Lincon S.、Gatto Bernardo B.、Sogi Naoya、Fukui Kazuhiro
- Journal Title
  
  Machine Learning with Applications
  
  Volume: 10 Pages: 100407～100407
- DOI
  10.1016/j.mlwa.2022.100407
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Analysis of Temporal Tensor Datasets on Product Grassmann Manifold2022
- Author(s)
  Bojan Batalo, Lincon S. Souza, Naoya Sogi, Bernardo B. Gatto, Kazuhiro Fukui
- Organizer
  CVPR 2022 Workshop on Vision Datasets Understanding
- Int'l Joint Research
[Presentation] Environmental sound classification based on CNN latent subspaces2022
- Author(s)
  Maha Mahyub, Lincon S. Souza, Bojan Batalo, Kazuhiro Fukui
- Organizer
  International Workshop on Acoustic Signal Enhancement (IWAENC 2022)

2022 Fiscal Year Research-status Report

Development of learning subspace-based methods for pattern recognition

Principal Investigator

SALESDESOUZA LINCON 国立研究開発法人産業技術総合研究所, 情報・人間工学領域, 研究員 (40912481)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Grassmannian learning mutual subspace method for image set recognition2023

Author(s)

Journal Title

DOI

[Journal Article] Temporal-stochastic tensor features for action recognition2022

Author(s)

Journal Title

DOI

[Presentation] Analysis of Temporal Tensor Datasets on Product Grassmann Manifold2022

Author(s)

Organizer

[Presentation] Environmental sound classification based on CNN latent subspaces2022

Author(s)

Organizer