2022 Fiscal Year Research-status Report

深層統計モデルによる科学的仮説検証のための非漸近推測理論の開発

Research Project

Project/Area Number	21K11780
Research Institution	The University of Tokyo
Principal Investigator	今泉允聡東京大学, 大学院総合文化研究科, 准教授 (90814088)
Project Period (FY)	2021-04-01 – 2024-03-31
Keywords	深層学習 / 統計学 / 統計的推論
Outline of Annual Research Achievements	本研究計画の目的は、深層ニューラルネットワークなどの多層構造や大自由度を持つ統計的モデルに対して、データに依存する不確実性を評価し、推定・予測の誤差を評価したり統計的推論の手法を構成することである。具体的には、ニューラルネットワークの関数表現能力を解析するとともに、データや学習アルゴリズムが持つランダムさ（不確実性）が推定や予測に与える影響を解析する。本年の研究実績は、主に二つに分けられる。一つは層が浅いが非常に多いパラメータを持つモデル（ニューラルネットワーク含む）に対する推定・推論性能の導出、もう一つは層が多いニューラルネットワークに対する推定性能の導出、の2点である。一つ目の層が浅いがパラメータが多いモデルについて、複数の応用的な設定において、学習によって得られるモデルの推定誤差や推論のための漸近分布をおこなった。具体的には、従属性を保つデータのための線形モデル、因果推論に用いられるモーメント制約や内生性を持つモデル、関数データなどの無限次元性を持つモデル、最適輸送を用いた統計モデルのような二重の最適化問題で定義される関数モデルについて、その推定誤差や漸近分布を導出した。これらの応用的研究は、近年発展している過剰パラメータ理論などをより幅広い統計モデルで使えることを示すものである。これらの研究はいくつかの学術雑誌に採択され、またいくつかは査読中である。二つ目の層が多いモデルについては、損失関数の形状に基づく予測誤差（汎化誤差）の特徴付けをおこなった。具体的には、非凸な期待損失関数が局所最適解の近傍で良い性質を持っているときに、学習アルゴリズムがその解の近傍に滞留しかつ良い予測性能を持つことを示した。これは、層の数やパラメータが大きいときにもそれらの影響が予測誤差に直接は影響しないことを示している。この研究は権威ある学術雑誌に採択された。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 本研究計画は、まず層の浅いモデルで推定や推論を行うという目的については、想定以上に順調に進展していると考えられる。共同研究者や研究室の学生との連携が円滑に進行していることもあり、複数のプロジェクトで既存理論を拡張・応用する研究が進展しており、複数の学会発表や論文公開が実践できている。またそれらのプロジェクトを経ることで、研究室としても知識の蓄積や研究体制の整備を進められている。この体制は来年以降も継続できる見込みである。層の多いモデルの推定・推論については、問題が高度に非自明になるため可視化できる成果はそれほど多くないが、それでも複数の研究方針が確立され、水面化で順調に議論や証明などが進行している。詳細は次項目の「今後の研究の推進方策」に述べるが、層の多いモデルを解析するための理論を複数開発しており、こちらも順調に進展していると考えられる。
Strategy for Future Research Activity	今後の研究課題も、これまで同様以下の二つで進行する。一つは、層の少なくパラメータの数が非常に多いモデルのための理論の応用・拡張である。近年はこの分野の理論がさらに発展しているため、より幅広い統計モデルや推定方法について、新規性のある理論を導出することが可能である。例えばほぼ無限個のパラメータがある統計モデルのためのベイズ推定・推論や、非線形構造を内部にもつセミパラメトリックな統計モデルのための統計的推論手法の開発である。これらは研究室の学生及び共同研究者を中心に成果が出始めており、年内に論文を公開することが可能である。もう一つは、層が多いモデルのための統計的な記述と推論手法の開発である。この領域はまだ確立された基礎理論がないため、どのような方針で推論を行うかが自明でなく、一つ目と比べると困難な問題である。これに対して、いくつかのガウス近似法や推論手法が応用できる可能性が明らかになったため、今後はこれらの理論を深層モデルに応用して推論を行う方法を開発する。
Causes of Carryover	購入する物品の金額などの関係で、微細な金額が次年度仕様になった。次年度においては、旅費や物品購入などに用いる。

Research Products
(42 results)

All 2023 2022 Other

All Int'l Joint Research (2 results) Journal Article (8 results) (of which Int'l Joint Research: 3 results, Peer Reviewed: 8 results, Open Access: 8 results) Presentation (28 results) (of which Int'l Joint Research: 10 results, Invited: 12 results) Book (2 results) Remarks (2 results)

[Int'l Joint Research] Twenty University(オランダ)
- Country Name
  NETHERLANDS
- Counterpart Institution
  Twenty University
[Int'l Joint Research] Max Planck Institute(ドイツ)
- Country Name
  GERMANY
- Counterpart Institution
  Max Planck Institute
[Journal Article] Unified Perspective on Probability Divergence via the Density-Ratio Likelihood: Bridging KL-Divergence and Integral Probability Metrics2023
- Author(s)
  Masahiro Kato, Masaaki Imaizumi, Kentaro Minami
- Journal Title
  
  Proceedings of Machine Learning Research
  
  Volume: 206 Pages: 5271--5298
- Peer Reviewed / Open Access
[Journal Article] On Generalization Bounds for Deep Networks Based on Loss Surface Implicit Regularization2023
- Author(s)
  Masaaki Imaizumi, Johannes Schmidt-Hieber
- Journal Title
  
  IEEE Transactions on Information Theory
  
  Volume: 69 Pages: 1203～1223
- DOI
  10.1109/TIT.2022.3215088
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Instrumental variable regression via kernel maximum moment loss2023
- Author(s)
  Zhang Rui、Imaizumi Masaaki、Sch?lkopf Bernhard、Muandet Krikamol
- Journal Title
  
  Journal of Causal Inference
  
  Volume: 11 Pages: none
- DOI
  10.1515/jci-2022-0073
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Fast Convergence on Perfect Classification for Functional Data2022
- Author(s)
  Tomoya Wakayama, Masaaki Imaizumi
- Journal Title
  
  Statistica Sinica
  
  Volume: to appear Pages: to appear
- Peer Reviewed / Open Access
[Journal Article] Inference for Projection-Based Wasserstein Distances on Finite Spaces2022
- Author(s)
  Ryo Okano, Masaaki Imaizumi
- Journal Title
  
  Statistica Sinica
  
  Volume: to appear Pages: to appear
- Peer Reviewed / Open Access
[Journal Article] Advantage of Deep Neural Networks for Estimating Functions with Singularity on Hypersurfaces2022
- Author(s)
  Masaaki Imaizumi, Kenji Fukumizu
- Journal Title
  
  Journal of Machine Learning Research
  
  Volume: 23 Pages: 1-54
- Peer Reviewed / Open Access
[Journal Article] 超曲面上に特異性を持つ関数推定における深層ニューラルネットワークの優位性2022
- Author(s)
  今泉允聡
- Journal Title
  
  日本統計学会誌
  
  Volume: 52 Pages: 33～51
- DOI
  10.11329/jjssj.52.33
- Peer Reviewed / Open Access
[Journal Article] Learning Causal Models from Conditional Moment Restrictions by Importance Weighting2022
- Author(s)
  Masahiro Kato, Masaaki Imaizumi, Kenichiro McAlinn, Shota Yasui, Haruo Kakehi
- Journal Title
  
  International Conference on Learning Representations
  
  Volume: to appear Pages: none
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] 高次元ガウス近似によるWasserstein距離推定の不確実性評価2022
- Author(s)
  今泉允聡
- Organizer
  Workshop OT 2023
- Invited
[Presentation] High-Dimensional Estimators: Universality and Non-Linearity2022
- Author(s)
  Masaaki Imaizumi
- Organizer
  Workshop on Functional Inference and Machine Intelligence
- Int'l Joint Research / Invited
[Presentation] フィルターが小さい深層CNNの最適近似レート2022
- Author(s)
  佐藤佑真、今泉允聡
- Organizer
  情報論的学習理論と機械学習研究会（IBISML）
[Presentation] 深層学習と過剰パラメータの理論2022
- Author(s)
  今泉允聡
- Organizer
  フォレストワークショップ2023
- Invited
[Presentation] Why does SGD prefer flat minima?: Through the lens of dynamical systems2022
- Author(s)
  Hikaru Ibayashi, Masaaki Imaizumi
- Organizer
  AAAI When Machine Learning meets Dynamical Systems: Theory and Applications
- Int'l Joint Research
[Presentation] 深層学習と過剰パラメータの理論2022
- Author(s)
  今泉允聡
- Organizer
  国立精神・神経医療研究センターの脳病態数理・データ科学セミナーシリーズ
- Invited
[Presentation] Best Arm Identification with a Fixed Budget under a Small Gap2022
- Author(s)
  M.Kato, K.Ariu, M.Imaizumi, M.Uehara, M.Nomura
- Organizer
  2023 ASA Annual Meeting
- Int'l Joint Research
[Presentation] High-dimensional asymptotics for single-index models via approximate message passing2022
- Author(s)
  Y. Uematsu, K. Sawaya, M. Imaizumi
- Organizer
  CMStatistics
- Int'l Joint Research / Invited
[Presentation] Sup-norm convergence of deep network estimator for nonparametric regression with corrected adversarial training2022
- Author(s)
  M.Imaizumi
- Organizer
  CMStatistics
- Int'l Joint Research / Invited
[Presentation] Theory of Deep Learning and Overparmeterization2022
- Author(s)
  M.Imaizumi
- Organizer
  Online Asian Machine Learning School, Asian Conference on Machine Learning
- Int'l Joint Research / Invited
[Presentation] Hypothesis Test and Confidence Analysis with Wasserstein Distance on General Dimension2022
- Author(s)
  M.Imaizumi
- Organizer
  EcoSta
- Int'l Joint Research / Invited
[Presentation] Benign overfitting in stochastic regression2022
- Author(s)
  S.Nakakita, M.Imaizumi
- Organizer
  EcoSta
- Int'l Joint Research / Invited
[Presentation] Semiparametric Contextual Best Arm Identification with a Fixed Budget2022
- Author(s)
  加藤真大, 今泉允聡, 石原卓弥, 北川透
- Organizer
  IBIS2022
[Presentation] Dynamics of Deep Neural Network: A Functional and Diffusion Process Approach2022
- Author(s)
  西澤渉, 今泉允聡
- Organizer
  IBIS2022
[Presentation] 非スパースな高次元漸近論の理論と応用2022
- Author(s)
  今泉允聡
- Organizer
  大規模複雑データの理論と方法論～新たな発展と関連分野への応用～
[Presentation] メカニズムとの学際的統合による新しい分散学習理論基盤の構築2022
- Author(s)
  今泉允聡
- Organizer
  IPSJ連続セミナー2022「その先へ　情報技術が貢献できること」
- Invited
[Presentation] High-Dimensional Asymptotics of Semiparametric Generalized Linear Models via Approximate Message Passing2022
- Author(s)
  澤谷一磨、植松良公、今泉允聡
- Organizer
  統計関連学会連合大会
[Presentation] 操作変数による非スパース高次元な線形回帰モデルの良性過適合2022
- Author(s)
  津田俊樹、今泉允聡
- Organizer
  統計関連学会連合大会
[Presentation] 多変量ガウス分布間回帰2022
- Author(s)
  岡野遼、今泉允聡
- Organizer
  統計関連学会連合大会
[Presentation] 補正付敵対的訓練による深層ニューラルネットワーク推定量のSupノルム収束2022
- Author(s)
  今泉允聡
- Organizer
  統計関連学会連合大会
[Presentation] Benign-Overfitting of Overparameterized Bayesian Linear Regression2022
- Author(s)
  若山智哉、今泉允聡
- Organizer
  統計関連学会連合大会
[Presentation] 線形時系列モデルにおける良性過適合2022
- Author(s)
  仲北祥悟、
- Organizer
  統計関連学会連合大会
[Presentation] 深層学習の原理を明らかにする理論の試み2022
- Author(s)
  今泉允聡
- Organizer
  電子情報通信学会Webinarチュートリアルシリーズ
- Invited
[Presentation] 深層学習の原理に挑む理論の試み2022
- Author(s)
  今泉允聡
- Organizer
  第46回IBISML研究会
[Presentation] 深層学習の原理を明らかにする理論の試み2022
- Author(s)
  Masaaki Imaizumi
- Organizer
  JFFoS
- Invited
[Presentation] 深層学習の原理記述に向けた構造汎化理論スキームの開発2022
- Author(s)
  今泉允聡
- Organizer
  JST創発的研究支援事業「融合の場」第１回公開シンポジウム
[Presentation] Benign Overfitting in Overparameterized Time Series Models2022
- Author(s)
  Shogo Nakakita, Masaaki Imaizumi
- Organizer
  Workshop on the Theory of Overparameterized Machine Learning
- Int'l Joint Research
[Presentation] Benign Overfitting in Conditional Average Treatment Effect Prediction with Linear Regression2022
- Author(s)
  Masahiro Kato, Masaaki Imaizumi
- Organizer
  Workshop on the Theory of Overparameterized Machine Learning
- Int'l Joint Research
[Book] 応用基礎としてのデータサイエンス　AI×データ活用の実践2023
- Author(s)
  北川源四郎、竹村彰通、赤穂昭太郎、今泉允聡、内田誠一、清智也、高野渉、辻真吾、原尚幸、久野遼平、松原仁、宮地充子、森畑明昌、宿久洋
- Total Pages
  384
- Publisher
  講談社
- ISBN
  4065307899
[Book] ニューラルネットワークとディープラーニング2022
- Author(s)
  Charu C. Aggarwal、竹村　彰通、今泉　允聡、李　鍾賛、今井　貴史、今井　徹、紅林　亘、齋藤　邦彦、健山　智子、寺田　裕、西出　俊、西出　亮
- Total Pages
  520
- Publisher
  学術図書出版社
- ISBN
  4780607140
[Remarks] 深層学習の原理を説明する新理論――ニューラルネットワークのエネルギー曲面上の滞留現象
- URL
  https://www.u-tokyo.ac.jp/focus/ja/press/z0109_00065.html
[Remarks] 深層学習が優位性を発揮する特異データ空間の存在を証明
- URL
  https://research-er.jp/articles/view/111826

2022 Fiscal Year Research-status Report

深層統計モデルによる科学的仮説検証のための非漸近推測理論の開発

Principal Investigator

今泉 允聡 東京大学, 大学院総合文化研究科, 准教授 (90814088)

Current Status of Research Progress

Reason

Research Products

[Int'l Joint Research] Twenty University(オランダ)

Country Name

Counterpart Institution

[Int'l Joint Research] Max Planck Institute(ドイツ)

Country Name

Counterpart Institution

[Journal Article] Unified Perspective on Probability Divergence via the Density-Ratio Likelihood: Bridging KL-Divergence and Integral Probability Metrics2023

Author(s)

Journal Title

[Journal Article] On Generalization Bounds for Deep Networks Based on Loss Surface Implicit Regularization2023

Author(s)

Journal Title

DOI

[Journal Article] Instrumental variable regression via kernel maximum moment loss2023

Author(s)

Journal Title

DOI

[Journal Article] Fast Convergence on Perfect Classification for Functional Data2022

Author(s)

Journal Title

[Journal Article] Inference for Projection-Based Wasserstein Distances on Finite Spaces2022

Author(s)

Journal Title

[Journal Article] Advantage of Deep Neural Networks for Estimating Functions with Singularity on Hypersurfaces2022

Author(s)

Journal Title

[Journal Article] 超曲面上に特異性を持つ関数推定における深層ニューラルネットワークの優位性2022

Author(s)

Journal Title

DOI

[Journal Article] Learning Causal Models from Conditional Moment Restrictions by Importance Weighting2022

Author(s)

Journal Title

[Presentation] 高次元ガウス近似によるWasserstein距離推定の不確実性評価2022

Author(s)

Organizer

[Presentation] High-Dimensional Estimators: Universality and Non-Linearity2022

Author(s)

Organizer

[Presentation] フィルターが小さい深層CNNの最適近似レート2022

Author(s)

Organizer

[Presentation] 深層学習と過剰パラメータの理論2022

Author(s)

Organizer

[Presentation] Why does SGD prefer flat minima?: Through the lens of dynamical systems2022

Author(s)

Organizer

[Presentation] 深層学習と過剰パラメータの理論2022

Author(s)

Organizer

[Presentation] Best Arm Identification with a Fixed Budget under a Small Gap2022

Author(s)

Organizer

[Presentation] High-dimensional asymptotics for single-index models via approximate message passing2022

Author(s)

Organizer

[Presentation] Sup-norm convergence of deep network estimator for nonparametric regression with corrected adversarial training2022

Author(s)

Organizer

[Presentation] Theory of Deep Learning and Overparmeterization2022

Author(s)

Organizer

[Presentation] Hypothesis Test and Confidence Analysis with Wasserstein Distance on General Dimension2022

Author(s)

Organizer

[Presentation] Benign overfitting in stochastic regression2022

Author(s)

Organizer

[Presentation] Semiparametric Contextual Best Arm Identification with a Fixed Budget2022

Author(s)

Organizer

[Presentation] Dynamics of Deep Neural Network: A Functional and Diffusion Process Approach2022

今泉允聡東京大学, 大学院総合文化研究科, 准教授 (90814088)

[Book] 応用基礎としてのデータサイエンス　AI×データ活用の実践2023