2022 Fiscal Year Annual Research Report

Statistical mechanics approach to high-dimensional machine learning

Planned Research

Project Area	Foundation of "Machine Learning Physics" --- Revolutionary Transformation of Fundamental Physics by A New Field Integrating Machine Learning and Physics
Project/Area Number	22H05117
Research Institution	The University of Tokyo
Principal Investigator	樺島祥介東京大学, 大学院理学系研究科(理学部), 教授 (80260652)
Co-Investigator(Kenkyū-buntansha)	小渕智之京都大学, 情報学研究科, 准教授 (40588448) 吉野元大阪大学, サイバーメディアセンター, 教授 (50335337) 坂田綾香統計数理研究所, 数理・推論研究系, 准教授 (80733071)
Project Period (FY)	2022-06-16 – 2027-03-31
Keywords	レプリカ法 / キャビティ法 / 深層学習 / アルゴリズム開発 / 性能分析
Outline of Annual Research Achievements	課題１：ここまでの研究で、L0正則化付きスパース線形回帰を解くためのマルコフ連鎖モンテカルロ法を用いたアルゴリズムの開発を8割程度完了した。現在，ハイパーパラメータを選択するための規準に関する研究を続けている。また、機械学習における典型的な非凸最適化問題として、混合ガウスモデルに基づく分類問題の理論解析を始めた。半教師有り学習におけるラベルなしデータの効果に関する知見やラベル分布が極端に偏っている場合に汎化性能をどのように高め得るかに関する知見を得ることが目的である．課題２：H. Yoshino (2020)で示したDeep Neural Network(DNN)のレプリカ理論が「密結合」1 << c << N(Nはネットワークの幅、cは個々のパーセプトロンへの入力数）で厳密になることを示した。また教師ー生徒シナリオについて理論・シミュレーションの両面から詳しく解析を行なった。over-pametrizationによってネットワーク中央に液体的な領域が出現するにも関わらず入出力層付近の固体的な領域によって深さL無限大でも汎化性能が維持されることを明らかにした。課題３：グループテストと呼ばれる離散変数のスパース推定に関する研究を行った。これまでdefective itemの数がシステムサイズのsublinearな場合については情報理論的研究が行われているが、本研究ではlinearな場合の研究をおこなった。従来法をこのlinearな場合に適用することは難しいが、我々は統計物理学の方法を導入することで解析できることを示した。解析の結果、linearな場合には変数の完全復元が不可能であることを発見した。そのため、統計的決定理論の枠組みを導入し、変数の状態を最適に決める方法を提案し、その性能を評価した。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 課題１；当初から予定していた研究であるモンテカルロ法に基づくアルゴリズムの開発をある程度完了した。またそれとは別に新しい研究を始めたが，その研究進捗も順調である．以上から，想定進捗速度を十分満たす状況である。課題２：DNNの統計力学で厳密に解ける模型が得られたことの意義は大きい。「密結合」の極限を念頭にした新たな数値シミュレーションも行い、有意な結果を得ることができた。課題３：グループテストにおける課題や、類似する問題が含む性質が明らかになってきている。研究内容は2本の論文として採択された。学会においても招待講演を含む発表をおこなった。
Strategy for Future Research Activity	課題１：上述のマルコフ連鎖モンテカルロ法を用いたアルゴリズム開発研究を論文化する予定である。開発したアルゴリズムはGithubなどを通じて公開する予定である。混合ガウスモデルに関する理論解析も論文化の目処が立っており、今年度中に論文として投稿できる可能性が高い。それらが済み次第、申請書に記述したRSBAMPの研究へと移行したい。また、深層学習におけるNeural Collapseの発生条件を吟味する理論解析研究を新たに立ち上げることを策定しており、これについても今年度中に一定の成果を得ることを予期している。課題２：引き継き、教師-生徒シナリオに関してmessage passingで学習を行うアルゴリズムの開発を行う。またベイズ最適性を崩した学習シナリオへのレプリカ理論の拡張およびシミュレーションによる解析を行う。また、これまでの教師-生徒シナリオの解析から示唆された学習の空間的不均一性を、MNIST,CIFAR10など現実のデータに基づく学習において調査する研究を行う。課題３：能動学習などの推定方法と組み合わせた場合の解析を引き続きおこなっていくことを予定している。これにより、さらなる推定性能の改善が見込まれる。また大きな計算コストが予想されるため、近似アルゴリズムを用いて実装することも検討している。

Research Products
(24 results)

All 2023 2022 Other

All Int'l Joint Research (1 results) Journal Article (5 results) (of which Int'l Joint Research: 3 results, Peer Reviewed: 5 results, Open Access: 2 results) Presentation (18 results) (of which Int'l Joint Research: 11 results, Invited: 12 results)

[Int'l Joint Research] Zhejiang University UIUC Institute(中国)
- Country Name
  CHINA
- Counterpart Institution
  Zhejiang University UIUC Institute
[Journal Article] Decision Theoretic Cutoff and ROC Analysis for Bayesian Optimal Group Testing2023
- Author(s)
  Ayaka Sakata, Yoshiyuki Kabashima
- Journal Title
  
  IEEE Transaction on Information Theory
  
  Volume: 69 Pages: 5902-5920
- DOI
  10.1109/TIT.2023.3276696
- Peer Reviewed / Open Access
[Journal Article] On Model Selection Consistency of Lasso for High-Dimensional Ising Models2023
- Author(s)
  Xiangming Meng, Tomoyuki Obuchi, Yoshiyuki Kabashima
- Journal Title
  
  Proceedings of Machine Learning Research
  
  Volume: 206 Pages: 6783-6805
- DOI
  10.48550/arXiv.2110.08500
- Peer Reviewed / Int'l Joint Research
[Journal Article] Average case analysis of Lasso under ultra sparse conditions2023
- Author(s)
  Koki Okajima, Xiangming Meng, Takashi Takahashi, Yoshiyuki Kabashima
- Journal Title
  
  Proceedings of Machine Learning Research
  
  Volume: 206 Pages: 11317-11330
- DOI
  10.48550/arXiv.2302.13093
- Peer Reviewed / Int'l Joint Research
[Journal Article] Quantized Compressed Sensing with Score-Based Generative Models2023
- Author(s)
  Xiangming Meng, Yoshiyuki Kabashima
- Journal Title
  
  Proceedings of ICLR 2023
  
  Volume: - Pages: -
- DOI
  10.48550/arXiv.2211.13006
- Peer Reviewed / Int'l Joint Research
[Journal Article] ベイズ的グループテストのカットオフ値評価と ROC 解析2022
- Author(s)
  坂田綾香，樺島祥介
- Journal Title
  
  統計数理
  
  Volume: 70 Pages: 89-114
- Peer Reviewed / Open Access
[Presentation] Spatially Heterogeneous Learning in a Deep Neural Network2023
- Author(s)
  Hajime Yoshino
- Organizer
  Towards a theory of artificial and biological neural networks
- Int'l Joint Research
[Presentation] Random energy model in a pure ferromagnet2023
- Author(s)
  Hajime Yoshino
- Organizer
  Physics of dense and active disordered materials
- Int'l Joint Research / Invited
[Presentation] Statistical inference of an assembly of vectors with a large number of components through their p-body products2023
- Author(s)
  Angelo Giorgio Cavaliere, Riki Nagasawa, Shuta Yokoi, Tomoyuki Obuchi and Hajime Yoshino
- Organizer
  Physics of dense and active disordered materials
- Int'l Joint Research
[Presentation] Spatial evolution of RSB in layered p-spin models2023
- Author(s)
  Yuki Rea Hamano and Hajime Yoshino
- Organizer
  Physics of dense and active disordered materials
- Int'l Joint Research
[Presentation] Decision Theoretic Cutoff and ROC Analysis for Bayesian Optimal Group Testing2023
- Author(s)
  Ayaka Sakata
- Organizer
  Workshop on Functional Inference and Machine Intelligence
- Int'l Joint Research / Invited
[Presentation] Statistical mechanics approach to linear regression2023
- Author(s)
  Yoshiyuki Kabashima
- Organizer
  Workshop on Functional Inference and Machine Intelligence
- Int'l Joint Research / Invited
[Presentation] 拡散モデルに基づく圧縮センシング2023
- Author(s)
  樺島祥介
- Organizer
  公開シンポジウム「データ駆動科学と情報計測の新展開」(DDIMA)
- Invited
[Presentation] On Model Selection Consistency of Lasso for High-Dimensional Ising Models2023
- Author(s)
  Xiangming Meng, Tomoyuki Obuchi, Yoshiyuki Kabashima
- Organizer
  The 26th International Conference on Artificial Intelligence and Statistics (AISTATS)ツ?
- Int'l Joint Research
[Presentation] Average case analysis of Lasso under ultra sparse conditions2023
- Author(s)
  Koki Okajima, Xiangming Meng, Takashi Takahashi, Yoshiyuki Kabashima
- Organizer
  The 26th International Conference on Artificial Intelligence and Statistics (AISTATS)
- Int'l Joint Research
[Presentation] Quantized Compressed Sensing with Score-Based Generative Models2023
- Author(s)
  Xiangming Meng, Yoshiyuki Kabashima
- Organizer
  The 26th International Conference on Artificial Intelligence and Statistics (AISTATS)
- Int'l Joint Research
[Presentation] Statistical Mechanics of a Deep Neural Network2022
- Author(s)
  Hajime Yoshino
- Organizer
  Forum de Physique Statistique a l窶僞cole Normale Superieure
- Int'l Joint Research / Invited
[Presentation] 深層パーセプトロン学習における熱平衡化2022
- Author(s)
  吉野元
- Organizer
  物性研究所スパコン共同利用・CCMS合同研究会「計算物質科学の新展開」
- Invited
[Presentation] 深層学習の統計力学とガラス的な濡れ転移2022
- Author(s)
  吉野元
- Organizer
  非平衡ソフトマター・アモルファス物質の物性解明への力学的自己組織化からの挑戦
- Invited
[Presentation] 深層学習における空間的不均一性2022
- Author(s)
  吉野元
- Organizer
  京都大学理学研究科物理学・宇宙物理学専攻セミナー
- Invited
[Presentation] 深層ニューラルネットワークにおける隠れた多様体模型の解析2022
- Author(s)
  吉野元
- Organizer
  日本物理学会
- Invited
[Presentation] 深層ニューラルネットワークにおけるレプリカ対称性の破れ2022
- Author(s)
  吉野元
- Organizer
  日本物理学会
- Invited
[Presentation] グループテストにおける確率伝搬法と最適カットオフ2022
- Author(s)
  坂田綾香
- Organizer
  離散数学とその応用研究集会
- Invited
[Presentation] Assessing transfer entropy from biochemcal data2022
- Author(s)
  Yoshiyuki Kabashima
- Organizer
  Nobel symposium "Predictability in Science in the age of Big Data
- Int'l Joint Research / Invited

2022 Fiscal Year Annual Research Report

Statistical mechanics approach to high-dimensional machine learning

Principal Investigator

樺島 祥介 東京大学, 大学院理学系研究科(理学部), 教授 (80260652)

Current Status of Research Progress

Reason

Research Products

[Int'l Joint Research] Zhejiang University UIUC Institute(中国)

Country Name

Counterpart Institution

[Journal Article] Decision Theoretic Cutoff and ROC Analysis for Bayesian Optimal Group Testing2023

Author(s)

Journal Title

DOI

[Journal Article] On Model Selection Consistency of Lasso for High-Dimensional Ising Models2023

Author(s)

Journal Title

DOI

[Journal Article] Average case analysis of Lasso under ultra sparse conditions2023

Author(s)

Journal Title

DOI

[Journal Article] Quantized Compressed Sensing with Score-Based Generative Models2023

Author(s)

Journal Title

DOI

[Journal Article] ベイズ的グループテストのカットオフ値評価と ROC 解析2022

Author(s)

Journal Title

[Presentation] Spatially Heterogeneous Learning in a Deep Neural Network2023

Author(s)

Organizer

[Presentation] Random energy model in a pure ferromagnet2023

Author(s)

Organizer

[Presentation] Statistical inference of an assembly of vectors with a large number of components through their p-body products2023

Author(s)

Organizer

[Presentation] Spatial evolution of RSB in layered p-spin models2023

Author(s)

Organizer

[Presentation] Decision Theoretic Cutoff and ROC Analysis for Bayesian Optimal Group Testing2023

Author(s)

Organizer

[Presentation] Statistical mechanics approach to linear regression2023

Author(s)

Organizer

[Presentation] 拡散モデルに基づく圧縮センシング2023

Author(s)

Organizer

[Presentation] On Model Selection Consistency of Lasso for High-Dimensional Ising Models2023

Author(s)

Organizer

[Presentation] Average case analysis of Lasso under ultra sparse conditions2023

Author(s)

Organizer

[Presentation] Quantized Compressed Sensing with Score-Based Generative Models2023

Author(s)

Organizer

[Presentation] Statistical Mechanics of a Deep Neural Network2022

Author(s)

Organizer

[Presentation] 深層パーセプトロン学習における熱平衡化2022

Author(s)

Organizer

[Presentation] 深層学習の統計力学とガラス的な濡れ転移2022

Author(s)

Organizer

[Presentation] 深層学習における空間的不均一性2022

Author(s)

Organizer

[Presentation] 深層ニューラルネットワークにおける隠れた多様体模型の解析2022

Author(s)

Organizer

[Presentation] 深層ニューラルネットワークにおけるレプリカ対称性の破れ2022

Author(s)

Organizer

[Presentation] グループテストにおける確率伝搬法と最適カットオフ2022

Author(s)

Organizer

樺島祥介東京大学, 大学院理学系研究科(理学部), 教授 (80260652)