• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Construction of numerical linear algebra based on lattice H-matrices and its high-performance implementation on modern architectures

Research Project

Project/Area Number 21H03447
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Review Section Basic Section 60090:High performance computing-related
Research InstitutionJapan Agency for Marine-Earth Science and Technology

Principal Investigator

Ida Akihiro  国立研究開発法人海洋研究開発機構, 付加価値情報創生部門(地球情報科学技術センター), 主任研究員 (80742121)

Co-Investigator(Kenkyū-buntansha) 横田 理央  東京工業大学, 学術国際情報センター, 教授 (20760573)
塙 敏博  東京大学, 情報基盤センター, 教授 (30308283)
岩下 武史  京都大学, 学術情報メディアセンター, 教授 (30324685)
大島 聡史  九州大学, 情報基盤研究開発センター, 准教授 (40570081)
星野 哲也  名古屋大学, 情報基盤センター, 准教授 (40775946)
平石 拓  京都橘大学, 工学部, 専任講師 (60528222)
Project Period (FY) 2021-04-01 – 2024-03-31
Project Status Completed (Fiscal Year 2023)
Budget Amount *help
¥17,290,000 (Direct Cost: ¥13,300,000、Indirect Cost: ¥3,990,000)
Fiscal Year 2023: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Fiscal Year 2022: ¥5,980,000 (Direct Cost: ¥4,600,000、Indirect Cost: ¥1,380,000)
Fiscal Year 2021: ¥5,330,000 (Direct Cost: ¥4,100,000、Indirect Cost: ¥1,230,000)
Keywords近似計算 / 低ランク / H行列法 / 数値線形代数 / 行列近似 / 高性能計算 / 並列計算 / アルゴリズム / 混合精度演算
Outline of Research at the Start

本研究では、科学技術計算およびデータ科学計算に必要な数値線形代数の行列演算について、格子H行列に基づき実行するアルゴリズム群を開発する。
格子H行列は、研究代表者らが提案した低ランク構造行列法の一種であり、従来型のH行列では困難なアルゴリズムの開発を見通し良く行うことができる。計算に必要な密行列演算を格子H行列による近似演算に置き換え、計算機メモリと演算量の大幅な削減により、計算の大規模化・高速化を実現する。
アルゴリズム開発は、最新の計算機アーキテクチャで構成されるクラスタ計算機を意識して行う。実装に際しては、混合精度演算・動的負荷分散などを含め最適な手法を研究する。

Outline of Final Research Achievements

We conducted research and development aimed at constructing a numerical linear algebra system based on the lattice H-matrix. We proposed an algorithm to calculate all eigenvalues for the BLR matrix, a special case of the lattice H-matrix. Research on high-performance implementation of the lattice H-matrix method was carried out. By adding efficient work-stealing functions to task parallelization languages, we successfully improved the computational performance of H-matrix partitioning and low-rank structured matrix generation on distributed memory parallel computers. Furthermore, we developed an H-matrix-vector multiplication computation method that achieves over 85% of the theoretical limit performance on computing nodes using various latest CPU architectures. Additionally, we developed a method for fast QR decomposition of BLR matrices using the MIG feature of the latest GPUs.

Academic Significance and Societal Importance of the Research Achievements

格子H行列法は、大規模な科学技術計算を少ない計算量とメモリ使用量で実地するための手法であり、多くの実問題(超電導解析、地震波動解析、マイクロマグネティクス計算など)で使用されている。本研究では、格子H行列法の理論的枠組みを拡張し、最新計算機アーキテクチャの性能を引出す数値計算アルゴリズムを開発した。この成果は、計算科学の理論的枠組みを拡張した学術的意義に加え、実応用シミュレーションを大規模化・高速化する社会的意義も大きい。

Report

(4 results)
  • 2023 Annual Research Report   Final Research Report ( PDF )
  • 2022 Annual Research Report
  • 2021 Annual Research Report
  • Research Products

    (51 results)

All 2024 2023 2022 2021 Other

All Journal Article (22 results) (of which Int'l Joint Research: 3 results,  Peer Reviewed: 13 results,  Open Access: 6 results) Presentation (28 results) (of which Int'l Joint Research: 15 results,  Invited: 6 results) Remarks (1 results)

  • [Journal Article] Optimize Efficiency of Utilizing Systems by Dynamic Core Binding2024

    • Author(s)
      Kawai Masatoshi、Ida Akihiro、Hanawa Toshihiro、Hoshino Tetsuya
    • Journal Title

      Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops

      Volume: - Pages: 77-82

    • DOI

      10.1145/3636480.3637221

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Algebraic Partition Construction Method for Hierarchical Matrices2024

    • Author(s)
      Ida Akihiro
    • Journal Title

      IEEE Transactions on Magnetics

      Volume: 60 Issue: 3 Pages: 1-4

    • DOI

      10.1109/tmag.2023.3305333

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An inherently parallel H2-ULV factorization for solving dense linear systems on GPUs2024

    • Author(s)
      Ma Qianxiang、Yokota Rio
    • Journal Title

      The International Journal of High Performance Computing Applications

      Volume: 1 Issue: 4 Pages: 1-10

    • DOI

      10.1177/10943420241242021

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] QR Factorization of?Block Low-Rank Matrices on Multi-instance GPU2023

    • Author(s)
      Ohshima Satoshi、Ida Akihiro、Yokota Rio、Yamazaki Ichitaro
    • Journal Title

      In Proceedings of the 23rd International Conference on Parallel and Distributed Computing, Applications and Technologies

      Volume: 1 Pages: 359-369

    • DOI

      10.1007/978-3-031-29927-8_28

    • ISBN
      9783031299261, 9783031299278
    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP2023

    • Author(s)
      Kawai Masatoshi、Ida Akihiro、Hanawa Toshihiro、Nakajima Kengo
    • Journal Title

      Proc. of Intl. Conf. on Computational Sci. 2023

      Volume: - Pages: 378-394

    • DOI

      10.1007/978-3-031-36024-4_30

    • ISBN
      9783031360237, 9783031360244
    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Hierarchical Matrix Calculation for FPGA using SYCL2023

    • Author(s)
      Yu Yijie, Toshihiro Hanawa
    • Journal Title

      情報処理学会研究報告

      Volume: 2023-HPC-192(34) Pages: 1-7

    • Related Report
      2023 Annual Research Report
  • [Journal Article] CUDA Fortran+MIG+UVMを用いたBLR行列QR分解の大規模高速化2023

    • Author(s)
      大島聡史, 伊田明弘, 河合直聡, 横田理央, Ichitaro Yamazaki
    • Journal Title

      情報処理学会 研究報告

      Volume: 2023-HPC-190 Pages: 1-8

    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Journal Article] Computing the k-th Eigenvalue of Symmetric H2-Matrices2023

    • Author(s)
      Apriansyah M. Ridwan、Yokota Rio
    • Journal Title

      International Conference on Parallel Processing (ICPP)

      Volume: N/A Pages: 1-10

    • DOI

      10.1145/3605573.3605607

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed
  • [Journal Article] O(N) distributed direct factorization of structured dense matrices using runtime systems.2023

    • Author(s)
      Deshmukh Sameer、Yokota Rio、Bosilca George、Ma Qinxiang
    • Journal Title

      International Conference on Parallel Processing (ICPP)

      Volume: N/A Pages: 1-10

    • DOI

      10.1145/3605573.3605606

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Subspace Correction Preconditioning for Solving a Sequence of Asymmetric Linear Systems Using the Bi-CGSTAB Method2023

    • Author(s)
      Tamori Hirotoshi、Fukaya Takeshi、Iwashita Takeshi
    • Journal Title

      Journal of Information Processing

      Volume: 31 Issue: 0 Pages: 875-884

    • DOI

      10.2197/ipsjjip.31.875

    • ISSN
      1882-6652
    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Solving Block Low-Rank Matrix Eigenvalue Problems2022

    • Author(s)
      Ida Akihiro
    • Journal Title

      Journal of Information Processing

      Volume: 30 Issue: 0 Pages: 538-551

    • DOI

      10.2197/ipsjjip.30.538

    • ISSN
      1882-6652
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2022

    • Author(s)
      Hoshino Tetsuya、Ida Akihiro、Hanawa Toshihiro
    • Journal Title

      2022 IEEE International Conference on Cluster Computing (CLUSTER)

      Volume: 2022 Pages: 462-472

    • DOI

      10.1109/cluster51413.2022.00056

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Parallelization of Matrix Partitioning in Hierarchical Matrix Construction on Distributed Memory Systems2022

    • Author(s)
      Zhengyang Bai, Tasuku Hiraishi, Akihiro Ida, Masahiro Yasugi
    • Journal Title

      Journal of Information Processing

      Volume: 30 Issue: 0 Pages: 742-754

    • DOI

      10.2197/ipsjjip.30.742

    • ISSN
      1882-6652
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Large-scale earthquake sequence simulations on 3-D non-planar faults using the boundary element method accelerated by lattice H-matrices2022

    • Author(s)
      Ozawa So、Ida Akihiro、Hoshino Tetsuya、Ando Ryosuke
    • Journal Title

      Geophysical Journal International

      Volume: 232 Issue: 3 Pages: 1471-1481

    • DOI

      10.1093/gji/ggac386

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Construction of Hierarchical Matrix on Distributed Memory Systems using a Task Parallel Language2022

    • Author(s)
      Bai Zhengyang、Hiraishi Tasuku、Ida Akihiro、Yasugi Masahiro、Fukazawa Keiichiro
    • Journal Title

      In Proceedings of 2022 The Tenth International Symposium on Computing and Networking Workshops (CANDARW)

      Volume: 1 Pages: 48-54

    • DOI

      10.1109/candarw57323.2022.00058

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 対称ブロック低ランク行列の固有値問題解法2022

    • Author(s)
      伊田明弘
    • Journal Title

      SWoPP2022(HPC研究分科会)

      Volume: 2022-HPC-185 Pages: 1-8

    • Related Report
      2022 Annual Research Report
  • [Journal Article] 対称ブロック低ランク行列の精度保証付き固有値問題解法2022

    • Author(s)
      伊田 明弘、荻田 武史、 横田 理央
    • Journal Title

      日本応用数理学会(2022年度年会)

      Volume: 1 Pages: 1-2

    • Related Report
      2022 Annual Research Report
  • [Journal Article] Fortran標準規格do concurrentを用いたGPUオフローディング手法の評価2022

    • Author(s)
      星野 哲也, 河合 直聡, 三木 洋平, 塙 敏博, 中島 研吾
    • Journal Title

      情報処理学会研究報告(Web)

      Volume: 2022-HPC-183 Pages: 1-8

    • Related Report
      2021 Annual Research Report
  • [Journal Article] 近似逆行列前処理における前処理行列生成部の簡略化とスレッド並列化2022

    • Author(s)
      鈴木謙吾, 深谷猛, 岩下武史
    • Journal Title

      情報処理学会研究報告(Web)

      Volume: 2021-HPC-182 Pages: 1-8

    • Related Report
      2021 Annual Research Report
  • [Journal Article] A64FXにおける階層型行列演算の性能評価2021

    • Author(s)
      星野哲也 , 伊田明弘 , 塙敏博
    • Journal Title

      情報処理学会研究報告(Web)

      Volume: 2021-HPC-180 Pages: 1-8

    • Related Report
      2021 Annual Research Report
  • [Journal Article] Large-scale earthquake sequence simulations of 3D geometrically complex faults using the boundary element method accelerated by lattice H-matrices on distributed memory computer systems2021

    • Author(s)
      So Ozawa, Akihiro Ida, Tetsuya Hoshino, Ryosuke Ando
    • Journal Title

      arXiv preprint

      Volume: - Pages: 1-26

    • Related Report
      2021 Annual Research Report
    • Open Access
  • [Journal Article] ブロック低ランク行列の近似固有値計算2021

    • Author(s)
      伊田明弘
    • Journal Title

      日本応用数理学会2021年年会講演予稿集

      Volume: - Pages: 1-2

    • Related Report
      2021 Annual Research Report
  • [Presentation] OpenACCを用いた地震シミュレーションのGPU並列化2024

    • Author(s)
      百武尚輝,星野哲也,小澤 創,伊田明弘,安藤亮輔,河合直聡,永井 亨,片桐孝洋
    • Organizer
      情報処理学会 第86回全国大会
    • Related Report
      2023 Annual Research Report
  • [Presentation] Considering multi process calculations on current GPU2024

    • Author(s)
      Satoshi Ohshima
    • Organizer
      ATAT in HPSC 2024
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Hierarchical Matrix Calculation for FPGA using SYCL2023

    • Author(s)
      Yu Yijie, Toshihiro Hanawa
    • Organizer
      報処理学会ハイパフォーマンスコンピューティング研究会
    • Related Report
      2023 Annual Research Report
  • [Presentation] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2023

    • Author(s)
      Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
    • Organizer
      日本地球惑星科学連合大会(JpGU)2023
    • Related Report
      2023 Annual Research Report
    • Invited
  • [Presentation] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2023

    • Author(s)
      Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
    • Organizer
      ICIAM 2023 (International Council for Industrial and Applied Mathematics 2023)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Dynamic Rupture Simulation Using FDP Method Accelerated by Lattice H-matrices2023

    • Author(s)
      Takumi Miyajima, Ryosuke Ando, Akihiro Ida
    • Organizer
      ICIAM 2023 (International Council for Industrial and Applied Mathematics 2023)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Acceleration of dynamic rupture simulation using FDP-LH matrices2023

    • Author(s)
      宮嶌巧光、安藤亮輔、伊田明弘
    • Organizer
      日本地球惑星科学連合大会(JpGU)2023
    • Related Report
      2023 Annual Research Report
  • [Presentation] 時空間境界積分方程式法の高速化手法FDP=Lattice H行列法の近似精度評価2023

    • Author(s)
      宮嶌 拓光、安藤 亮輔、伊田 明弘
    • Organizer
      日本地震学会 2023 年度秋季大会
    • Related Report
      2023 Annual Research Report
  • [Presentation] Acceleration of the Spatiotemporal Boundary Integral Equation Method Using the FDP=Lattice H Matrices Method2023

    • Author(s)
      Takumi Miyajima、Ryosuke Ando, Akihiro Ida
    • Organizer
      Advancing Earth and Space Sciences 2023(AGU23)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 一万計算コア超時代のGPUに向けたプログラム最適化と自動チューニングを考える2023

    • Author(s)
      大島 聡史、伊田 明弘、横田 理央、Yamazaki Ichitaro
    • Organizer
      第15回 自動チューニング技術の現状と応用に関するシンポジウム(ATTA2023)
    • Related Report
      2023 Annual Research Report
  • [Presentation] QR Factorization of Block Low-rank Matrices on Multiple-/Multi-Instance GPUs2023

    • Author(s)
      Satoshi Ohshima
    • Organizer
      2024 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] BLR-QR on GPU:マルチインスタンスGPUを用いた多数の小密行列計算の高速化2022

    • Author(s)
      大島聡史,伊田明弘,横田理央,Yamazaki Ichitaro
    • Organizer
      第14回 自動チューニング技術の現状と応用に関するシンポジウム(ATTA2022)
    • Related Report
      2022 Annual Research Report
  • [Presentation] High-resolution Simulation of Earthquake Recurrence Enabled by Optimization for Multi-core CPUs and Large-scale Parallelization2022

    • Author(s)
      Ryosuke Ando, So Ozawa, Akihiro Ida, Tetsuya Hoshino, Kazunori Muramatsu, Ryoya Matsushima, Masatoshi Kawai, Toshihiro Hanawa
    • Organizer
      The International Conference for High Performance Computing, Networking, Storage, and Analysis 2023 (SC23)
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Presentation] CholeskyQRとBCGS2による非縦長行列のQR分解2022

    • Author(s)
      門倉陣之介, 深谷猛, 岩下武史
    • Organizer
      The 6th cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2022)
    • Related Report
      2022 Annual Research Report
  • [Presentation] A new version of AINV preconditioning simplified by using nonzero element positions of a coefficient matrix2022

    • Author(s)
      Kengo Suzuki, Takeshi Fukaya and Takeshi Iwashita
    • Organizer
      15th World Congress on Computational Mechanics 8th Asian Pacific Congress on Compuational Mechanics (WCCM-APCOM YOKOHAMA2022)
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Parallelization of Matrix Partitioning in Hierarchical Matrix Construction Using Node-aware Work Stealing2022

    • Author(s)
      Zhengyang Bai、Tasuku Hiraishi、Akihiro Ida、Masahiro Yasugi
    • Organizer
      情報処理学会 第138回プログラミング研究発表会
    • Related Report
      2021 Annual Research Report
  • [Presentation] Numerical Evaluation of Dynamic Core Binding Library with H-matrix Application2022

    • Author(s)
      Masatoshi Kawai, Akihiro Ida
    • Organizer
      2022 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Optimizations of Lattice H-matrix-vector Multiplication for Modern Supercomputers2022

    • Author(s)
      Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
    • Organizer
      SIAM Conference on Parallel Processing for Scientific Computing 2022 (SIAM PP 22)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Optimizations of H-matrix-vector Multiplication for A64FX2022

    • Author(s)
      Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
    • Organizer
      2022 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing (ATAT in HPSC 2022)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Acceleration of O(N) Solvers for Large Dense Matrices, Conference on Advanced Topics and Auto Tuning in High Performance Scientific Computing2022

    • Author(s)
      Sameer Deshmukh
    • Organizer
      2022 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing (ATAT in HPSC 2022)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Parallel QR Factorization of Block Low-rank Matrices2022

    • Author(s)
      Muhammad Ridwan Apriansyah
    • Organizer
      2022 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing (ATAT in HPSC 2022)
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] AIterative Refinement with Hierarchical Low-rank Preconditioners Using Mixed Precision2022

    • Author(s)
      Thomas Spendlhofer
    • Organizer
      Conference on Advanced Topics and Auto Tuning in High Performance Scientific Computing
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Offloading Integer GMRES Method to Accelerators2022

    • Author(s)
      Yijie Yu, Toshihiro Hanawa
    • Organizer
      HPC Asia 2022
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Presentation] QR Factorization of Block Low-rank Matrices on Multi-Instance GPU2022

    • Author(s)
      Satoshi Ohshima
    • Organizer
      Conference on Advanced Topics and Auto Tuning in High Performance Scientific Computing
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 格子H行列法とFDP法による3D弾性波動伝播の高速計算2021

    • Author(s)
      伊田明弘、安藤亮輔、佐藤大裕、小澤創、星野哲也
    • Organizer
      SWoPP2021(「行列・固有値問題の解法とその応用」研究部会)
    • Related Report
      2021 Annual Research Report
  • [Presentation] 格子H行列を用いた3次元非平面断層の超大規模地震サイクルシミュレーション2021

    • Author(s)
      小澤創、伊田明弘、星野哲也、安藤亮輔
    • Organizer
      日本地震学会(2021年度秋季大会)
    • Related Report
      2021 Annual Research Report
  • [Presentation] 最近のマルチコアCPU環境における疎行列ベクトル積の性能に関する一考察2021

    • Author(s)
      深谷猛, 岩下武史, 中島浩
    • Organizer
      SWoPP2021(「行列・固有値問題の解法とその応用」研究部会)
    • Related Report
      2021 Annual Research Report
  • [Presentation] GPU に適した近似逆行列前処理の簡略化手法2021

    • Author(s)
      鈴木謙吾, 深谷猛, 岩下武史
    • Organizer
      日本応用数理学会(2021年度年会)
    • Related Report
      2021 Annual Research Report
  • [Remarks] HACApK code

    • URL

      https://github.com/Post-Peta-Crest/ppOpenHPC/tree/MATH/HACApK

    • Related Report
      2023 Annual Research Report 2022 Annual Research Report 2021 Annual Research Report

URL: 

Published: 2021-04-28   Modified: 2025-01-30  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi