2023 Fiscal Year Final Research Report

Statistical Learning with feature extraction and information integration of High-dimensional, large-scale, multi-domain data

Research Project

PDF

Project/Area Number	19H04071
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Review Section	Basic Section 60030:Statistical science-related
Research Institution	Tokyo Institute of Technology
Principal Investigator	Kanamori Takafumi 東京工業大学, 情報理工学院, 教授 (60334546)
Co-Investigator(Kenkyū-buntansha)	熊谷亘東京大学, 大学院工学系研究科(工学部), 特任助教 (20747167) 竹之内高志政策研究大学院大学, 政策研究科, 教授 (50403340) 松井孝太名古屋大学, 医学系研究科, 講師 (50737111) 川島孝行東京工業大学, 情報理工学院, 助教 (60846210) 武田朗子東京大学, 大学院情報理工学系研究科, 教授 (80361799)
Project Period (FY)	2019-04-01 – 2024-03-31
Keywords	AI / データサイエンス
Outline of Final Research Achievements	This study aims to construct a framework for statistical learning using high-dimensional and large-scale multi-domain data. In the era of big data, diverse and complex data with different sizes, dimensions, and representation formats can be collected across various domains. However, there exists a paradox wherein the relationships between domains are often unclear, leading to a knowledge deficit as data volume increases. To overcome this, it is crucial to extract and integrate features of data while considering the inter-domain relationships. This research focuses on formalizing this task with a focus on inter-domain relationships. It seeks to develop modeling techniques and machine learning algorithms for multi-domain data with heterogeneous structures, aiming to advance the theoretical understanding in this field.
Free Research Field	機械学習
Academic Significance and Societal Importance of the Research Achievements	本研究では，異なるデータサイズ，次元，タイプなどの多様なデータを活用し，予測，推論，構造推定など複数のタスクを行う学習アルゴリズムを，数学的な知見に基づいて提案，開発する．理論的解析により予測精度向上のためのパラメーター調整などが容易になり，飛躍的な性能向上が期待できる．理論的知見に基づくアルゴリズムの実装により，画像，音声，タグその他の情報を含むヘテロなデータからの関連性分析などの精度が大きく向上し，機械学習システムの安全性や信頼性を高める基盤を提供する。