2023 Fiscal Year Research-status Report

Effective models constructed by neural networks with symmetries

Research Project

Project/Area Number	22K03539
Research Institution	The University of Tokyo
Principal Investigator	永井佑紀東京大学, 情報基盤センター, 准教授 (20587026)
Co-Investigator(Kenkyū-buntansha)	富谷昭夫東京女子大学, 現代教養学部, 講師 (50837185)
Project Period (FY)	2022-04-01 – 2025-03-31
Keywords	機械学習 / 自己学習モンテカルロ法 / Transformer / ニューラルネットワーク
Outline of Annual Research Achievements	近年、生成AIと呼ばれる大規模言語モデルが大きく性能を伸ばし、さまざまな分野へと波及している。この大規模言語モデルの基本ネットワークアーキテクチャはTransformerであり、そのビルディングブロックはAttention層である。言語モデルにおいて非常に高い性能を上げているこれらのアーキテクチャは、物理系のモデルにおいても同様に高い性能が得られる可能性がある。一方で、大規模言語モデルは数十億以上のパラメータを持ち、訓練のためには大規模な計算機資源が必要であり、かつ、推論でも高性能な計算資源が必要であるため、物理系のシミュレーションを高速化するためにはそのままでは困難があると想定された。そこで、本科研費のテーマである「対称性」をネットワーク構造に持たせることで、Transformerネットワークのパラメータ数を劇的に減らすことを試みた。スピン系においてこの方法はうまくいっており、現在論文を投稿中である。
Current Status of Research Progress	Current Status of Research Progress 1: Research has progressed more than it was originally planned. Reason Transformerアーキテクチャに物理系の対称性を取り込む方法を発見し、最新の機械学習手法である大規模言語モデルと同等のネットワークアーキテクチャの構成に成功したため。
Strategy for Future Research Activity	系の物理的対称性を保ったTransformerとAttention機構を組み込んだニューラルネットワークが出来上がったので、この方法がさまざまな物理系で使えるかどうかを調べ、大規模言語モデルのような高い性能を持つかどうかを調べる。
Causes of Carryover	GPUを用いてニューラルネットワークを訓練と推論を行うため、最新のGPUを搭載した計算機をR6年度の前半に導入する予定である。計算機の購入のために次年度使用額が0より大きくなっている。

Research Products
(11 results)

All 2024 2023

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (8 results) (of which Int'l Joint Research: 3 results, Invited: 3 results)

[Journal Article] High-temperature atomic diffusion and specific heat in quasicrystals2024
- Author(s)
  Yuki Nagai, Yutaka Iwasaki, Koichi Kitahara, Yoshiki Takagiwa, Kaoru Kimura, and Motoyuki Shiga
- Journal Title
  
  Physical Review Letters
  
  Volume: 132 Pages: 196301-1,6
- DOI
  10.1103/PhysRevLett.132.196301
- Peer Reviewed
[Journal Article] Equivariant transformer is all you need2023
- Author(s)
  Akio Tomiya, Yuki Nagai
- Journal Title
  
  PoS LATTICE2023
  
  Volume: - Pages: 001
- Peer Reviewed
[Journal Article] Sparse modeling approach to extract spectral functions with covariance of Euclidean-time correlators of lattice QCD2023
- Author(s)
  Junichi Takahashi, Hiroshi Ohno, Akio Tomiya
- Journal Title
  
  PoS LATTICE2023
  
  Volume: - Pages: 028
- Peer Reviewed
[Presentation] Application of Julia in Particle Physics -Toward Large-Scale Computations of Lattice QCD-2023
- Author(s)
  Akio Tomiya
- Organizer
  20 June 2023, Julia in mathematics and physics
[Presentation] Self-learning Monte Carlo2023
- Author(s)
  Akio Tomiya
- Organizer
  Kick-off Meeting for the Accelerated Program in Fugaku
[Presentation] Development of machine learning methods for simulation of quantum theory of lattice fields2023
- Author(s)
  Akio Tomiya
- Organizer
  HPCI Computational Science Forum
[Presentation] Advances in Lattice QCD with Machine Learning2023
- Author(s)
  Akio Tomiya
- Organizer
  JPS Meeting (Symposium)
- Invited
[Presentation] Flow based sampling for 3- and 4-dim. model2023
- Author(s)
  Akio Tomiya
- Organizer
  The Future is non-perturbative
- Int'l Joint Research / Invited
[Presentation] Machine learning and Lattice QCD2023
- Author(s)
  Akio Tomiya
- Organizer
  Challenges and opportunities in Lattice QCD simulations and related fields
- Int'l Joint Research / Invited
[Presentation] Atomic diffusion due to hyperatomic fluctuation for quasicrystals and their approximants2023
- Author(s)
  Yuki Nagai
- Organizer
  International conference on complex orders in condensed matter: aperiodic order, local order, electronic order, hidden order
- Int'l Joint Research
[Presentation] Juliaによる科学技術計算:大規模並列計算について2023
- Author(s)
  永井佑紀
- Organizer
  数学と物理におけるJuliaの活用

2023 Fiscal Year Research-status Report

Effective models constructed by neural networks with symmetries

Principal Investigator

永井 佑紀 東京大学, 情報基盤センター, 准教授 (20587026)

Current Status of Research Progress

Reason

Research Products

[Journal Article] High-temperature atomic diffusion and specific heat in quasicrystals2024

Author(s)

Journal Title

DOI

[Journal Article] Equivariant transformer is all you need2023

Author(s)

Journal Title

[Journal Article] Sparse modeling approach to extract spectral functions with covariance of Euclidean-time correlators of lattice QCD2023

Author(s)

Journal Title

[Presentation] Application of Julia in Particle Physics -Toward Large-Scale Computations of Lattice QCD-2023

Author(s)

Organizer

[Presentation] Self-learning Monte Carlo2023

Author(s)

Organizer

[Presentation] Development of machine learning methods for simulation of quantum theory of lattice fields2023

Author(s)

Organizer

[Presentation] Advances in Lattice QCD with Machine Learning2023

Author(s)

Organizer

[Presentation] Flow based sampling for 3- and 4-dim. model2023

Author(s)

Organizer

[Presentation] Machine learning and Lattice QCD2023

Author(s)

Organizer

[Presentation] Atomic diffusion due to hyperatomic fluctuation for quasicrystals and their approximants2023

Author(s)

Organizer

[Presentation] Juliaによる科学技術計算:大規模並列計算について2023

Author(s)

Organizer

永井佑紀東京大学, 情報基盤センター, 准教授 (20587026)