High-performance computing and data analysis support leveraging unused cores

Research Project

Project/Area Number	20H00580
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	一般
Review Section	Medium-sized Section 60:Information science, computer engineering, and related fields
Research Institution	The University of Tokyo
Principal Investigator	Hanawa Toshihiro 東京大学, 情報基盤センター, 教授 (30308283)
Co-Investigator(Kenkyū-buntansha)	下川辺隆史東京大学, 情報基盤センター, 准教授 (40636049) 星野哲也名古屋大学, 情報基盤センター, 准教授 (40775946) 三木洋平東京大学, 情報基盤センター, 助教 (70734375) 伊田明弘国立研究開発法人海洋研究開発機構, 付加価値情報創生部門(地球情報科学技術センター), 主任研究員 (80742121)
Project Period (FY)	2020-04-01 – 2023-03-31
Project Status	Completed (Fiscal Year 2023)
Budget Amount *help	¥45,110,000 (Direct Cost: ¥34,700,000、Indirect Cost: ¥10,410,000) Fiscal Year 2022: ¥9,620,000 (Direct Cost: ¥7,400,000、Indirect Cost: ¥2,220,000) Fiscal Year 2021: ¥19,760,000 (Direct Cost: ¥15,200,000、Indirect Cost: ¥4,560,000) Fiscal Year 2020: ¥15,730,000 (Direct Cost: ¥12,100,000、Indirect Cost: ¥3,630,000)
Keywords	高性能計算 / メニーコア / 余剰コア / MPI / GPU / OpenMP / 非同期IO / メニーコアプロセッサ / プロファイリング / メニーコア・マルチコアプロセッサ / 計算科学 / 並列処理 / 電力制御
Outline of Research at the Start	近年のCPUの性能向上は，チップ内コア数の増加によって支えられているが，計算パターンによっては全コアを使うと却って性能を低下させる場合もあり，そのような場合には多くのコアを敢えて使わずに残すことになる。本研究提案では，そのような「余剰コア」に対して，CPUで実行している主計算を支援する役割を与えることで，システム全体の性能改善や，電力制御，付加機能を低オーバヘッドで実現することを目的とし，ユーザレベルで支援機能を制御するためのフレームワーク "UTHelper"を開発する。さらに，実際のスーパーコンピュータ上でUTHelperを用いてアプリケーションに適用し，有効性を実証する。
Outline of Final Research Achievements	This research aims to improve the overall system performance and realize additional functions such as power control and profiling functions with low overhead by giving "extra cores" that do not directly contribute to the performance improvement of high-performance computation a role in supporting the main computation running on the CPU. We studied "UTHelper," a framework to realize such support functions at the user level. As a result, we realized profiling and parallelism change during execution without modifying the user program, in situ analysis using extra cores, load balancing using dynamic core allocation to speed up lattice H-matrix operations, inter-GPU communication using extra cores, and utilization of idle arithmetic units through time-space blocking.
Academic Significance and Societal Importance of the Research Achievements	近年の高性能計算システムにおいてCPUのコア数が増加し続けているが，全コアを使うのが最善だと誤解されることも多く，実際には消費電力や放熱，メモリバンド幅等の観点でコアをあえて使用せず残した方がよい場合がある．つまり「余剰コア」が生じる．本研究では，いくつかの実アプリケーションに対し，これらの余剰コアを補助的に用い，主計算を支援することによって，全体として性能向上を実現できることや，性能を落とさずに付加機能を実現できることを実例として示すことができた．

Report

(5 results)

2023 Final Research Report ( PDF )
2022 Annual Research Report
2021 Annual Research Report
2020 Comments on the Screening Results Annual Research Report

Research Products
(33 results)

All 2024 2023 2022 2021 2020 Other

All Journal Article (19 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 6 results, Open Access: 3 results) Presentation (11 results) (of which Int'l Joint Research: 8 results, Invited: 2 results) Remarks (3 results)

[Journal Article] Algebraic Partition Construction Method for Hierarchical Matrices2024
- Author(s)
  Ida Akihiro
- Journal Title
  
  IEEE Transactions on Magnetics
  
  Volume: 60 Issue: 3 Pages: 1-4
- DOI
  10.1109/tmag.2023.3305333
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] Optimize Efficiency of Utilizing Systems by Dynamic Core Binding2024
- Author(s)
  Kawai Masatoshi、Ida Akihiro、Hanawa Toshihiro、Hoshino Tetsuya
- Journal Title
  
  Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops
  
  Volume: - Pages: 77-82
- DOI
  10.1145/3636480.3637221
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Pipelined task control using OpenMP2024
- Author(s)
  Tianya Wu, Toshihro Hanawa
- Journal Title
  
  情報処理学会研究報告（ハイパフォーマンスコンピューティング）
  
  Volume: 2024-HPC-193 (29) Pages: 1-7
- Related Report
  2022 Annual Research Report
[Journal Article] Decoupling of Simulation on Heterogeneous Clusters Using WaitIO2024
- Author(s)
  Wenrui Mai, Toshihiro Hanawa
- Journal Title
  
  情報処理学会研究報告（ハイパフォーマンスコンピューティング）
  
  Volume: 2024-HPC-193 (23) Pages: 1-6
- Related Report
  2022 Annual Research Report
[Journal Article] NVIDIA H100 PCIeおよびAMD MI210におけるN体計算コードの性能評価2023
- Author(s)
  三木洋平, 塙敏博
- Journal Title
  
  情報処理学会研究報告（ハイパフォーマンスコンピューティング）
  
  Volume: 2023-HPC-190 (28) Pages: 1-13
- Related Report
  2022 Annual Research Report
[Journal Article] Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP2023
- Author(s)
  Kawai Masatoshi、Ida Akihiro、Hanawa Toshihiro、Nakajima Kengo
- Journal Title
  
  Proc. of Intl. Conf. on Computational Sci. 2023
  
  Volume: - Pages: 378-394
- DOI
  10.1007/978-3-031-36024-4_30
- ISBN
  9783031360237, 9783031360244
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] 複数GPUでの埋め込み境界-格子ボルツマン法の計算の最適化と性能モデルの構築2023
- Author(s)
  畠山昂, 下川辺隆史
- Journal Title
  
  情報処理学会研究報告（ハイパフォーマンスコンピューティング）
  
  Volume: 2023-HPC-188 (17) Pages: 1-8
- Related Report
  2022 Annual Research Report
[Journal Article] oneAPIを用いた様々なデバイス上でのステンシル計算の実装2023
- Author(s)
  佐久間大我、下川辺隆史、大森拓郎
- Journal Title
  
  第28回計算工学講演会
  
  Volume: 28 Pages: 716-718
- Related Report
  2022 Annual Research Report
[Journal Article] Accelerating lattice Boltzmann method simulation with GPU computation using C++ standard language parallelism2023
- Author(s)
  Ziheng Yuan, Takashi Shimokawabe
- Journal Title
  
  第28回計算工学講演会
  
  Volume: 28 Pages: 725-729
- Related Report
  2022 Annual Research Report
[Journal Article] Solving Block Low-Rank Matrix Eigenvalue Problems2022
- Author(s)
  Ida Akihiro
- Journal Title
  
  Journal of Information Processing
  
  Volume: 30 Issue: 0 Pages: 538-551
- DOI
  10.2197/ipsjjip.30.538
- ISSN
  1882-6652
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] 対称ブロック低ランク行列の固有値問題解法2022
- Author(s)
  伊田　明弘
- Journal Title
  
  情報処理学会研究報告（ハイパフォーマンスコンピューティング）
  
  Volume: 2022-HPC-185 () Pages: 1-8
- Related Report
  2022 Annual Research Report
[Journal Article] 複数GPUを用いる際の埋め込み境界-格子ボルツマン法の性能向上2022
- Author(s)
  畠山昂, 下川辺隆史
- Journal Title
  
  第27回計算工学講演会論文集
  
  Volume: 27 Pages: 975-980
- Related Report
  2022 Annual Research Report
[Journal Article] OpenMP Offloadingを用いたGPUでの格子ボルツマン法実行における性能評価2022
- Author(s)
  大森拓郎, 下川辺隆史, 朝比祐一
- Journal Title
  
  第27回計算工学講演会論文集
  
  Volume: 27 Pages: 981-986
- Related Report
  2022 Annual Research Report
[Journal Article] Large-scale earthquake sequence simulations on 3-D non-planar faults using the boundary element method accelerated by lattice H-matrices2022
- Author(s)
  Ozawa So、Ida Akihiro、Hoshino Tetsuya、Ando Ryosuke
- Journal Title
  
  Geophysical Journal International
  
  Volume: 232 Issue: 3 Pages: 1471-1481
- DOI
  10.1093/gji/ggac386
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2022
- Author(s)
  Hoshino Tetsuya、Ida Akihiro、Hanawa Toshihiro
- Journal Title
  
  2022 IEEE International Conference on Cluster Computing (CLUSTER)
  
  Volume: 2022 Pages: 462-472
- DOI
  10.1109/cluster51413.2022.00056
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] 余剰コアを活用したOpenMP TaskによるIn-situ解析の実現2022
- Author(s)
  赤沢龍哉, 塙敏博, 三木洋平
- Journal Title
  
  情報処理学会研究報告ハイパフォーマンスコンピューティング
  
  Volume: 2022-HPC-183 Pages: 1-10
- Related Report
  2021 Annual Research Report
[Journal Article] A64FXにおけるテンポラルブロッキングの実装と性能評価2021
- Author(s)
  星野哲也, 塙敏博
- Journal Title
  
  情報処理学会研究報告ハイパフォーマンスコンピューティング
  
  Volume: 2021-HPC-178 Pages: 1-8
- Related Report
  2020 Annual Research Report
[Journal Article] 余剰コアの活用に向けた実行中プロファイリング手法の検討2020
- Author(s)
  工藤純，塙敏博
- Journal Title
  
  情報処理学会研究報告ハイパフォーマンスコンピューティング
  
  Volume: 2020-HPC-177 (7) Pages: 1-8
- Related Report
  2020 Annual Research Report
[Journal Article] 不均一なコア割付による動的負荷分散手法の検討2020
- Author(s)
  河合直聡, 伊田明弘, 中島研吾
- Journal Title
  
  日本応用数理学会2020年年会講演予稿集
  
  Volume: - Pages: 1-2
- Related Report
  2020 Annual Research Report
[Presentation] Accelerating Lattice Boltzmann method with C++ standard language parallel algorithm2024
- Author(s)
  Ziheng Yuan and Takashi Shimokawabe
- Organizer
  International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia) 2024 (poster)
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] N体シミュレーションコードのGPU実装 --これまでとこれから--2023
- Author(s)
  三木洋平
- Organizer
  シミュレーション天文学のこれまでとこれから -ハードウェア・アプリケーション・サイエンス-
- Related Report
  2022 Annual Research Report
[Presentation] Accelerating lattice Boltzmann method with GPU and C++ standard parallelization2023
- Author(s)
  Ziheng Yuan, Takashi Shimokawabe
- Organizer
  10th International Congress on Industrial and Applied Mathematics
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2023
- Author(s)
  Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
- Organizer
  10th International Congress on Industrial and Applied Mathematics
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2023
- Author(s)
  Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
- Organizer
  Japan Geoscience Union Meeting 2023
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] AMD製GPU・NVIDIA製GPU両対応direct N-body codeの実装と性能評価2022
- Author(s)
  三木洋平, 塙敏博, 河合直聡, 星野哲也
- Organizer
  日本天文学会 2022年秋季年会
- Related Report
  2022 Annual Research Report
[Presentation] Performance Optimization Of Lattice Boltzmann Method On A64FX2022
- Author(s)
  Takuro Omori, Takashi Shimokawabe
- Organizer
  15th World Congress on Computational Mechanics & 8th Asian Pacific Congress on Computational Mechanics
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] Performance improvement of immersed boundary-lattice Boltzmann method on multiple GPUs2022
- Author(s)
  Akira Hatakeyama, Takashi Shimokawabe
- Organizer
  15th World Congress on Computational Mechanics & 8th Asian Pacific Congress on Computational Mechanics
- Related Report
  2022 Annual Research Report
- Int'l Joint Research
[Presentation] Numerical Evaluation of Dynamic Core Binding Library with H-matrix Application2022
- Author(s)
  Masatoshi Kawai, Akihiro Ida
- Organizer
  2022 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing
- Related Report
  2021 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] メニーコアシステムにおける余剰コア有効利用に向けて2022
- Author(s)
  塙敏博
- Organizer
  第28回自動チューニング研究会オープンアカデミックセッション（ATOS28）
- Related Report
  2021 Annual Research Report
- Invited
[Presentation] Multi-GPU computing of moving boundary flow using lattice Boltzmann method2022
- Author(s)
  Akira Hatakeyama, Takashi Shimokawabe
- Organizer
  International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia) 2022, poster
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Remarks] UT-Helper: Support for Utilizing Unused cores
- URL
  https://www.cc.u-tokyo.ac.jp/en/public/sc22.php
- Related Report
  2021 Annual Research Report
[Remarks] 次世代スパコンに向けた演算加速・利用者支援
- URL
  https://www.youtube.com/watch?v=Q5xqOzpKXW0
- Related Report
  2020 Annual Research Report
[Remarks] SuperComputing 2020
- URL
  https://www.cc.u-tokyo.ac.jp/public/sc20.php
- Related Report
  2020 Annual Research Report

High-performance computing and data analysis support leveraging unused cores

Principal Investigator

Hanawa Toshihiro 東京大学, 情報基盤センター, 教授 (30308283)

¥45,110,000 (Direct Cost: ¥34,700,000、Indirect Cost: ¥10,410,000)

Report

Research Products

[Journal Article] Algebraic Partition Construction Method for Hierarchical Matrices2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Optimize Efficiency of Utilizing Systems by Dynamic Core Binding2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Pipelined task control using OpenMP2024

Author(s)

Journal Title

Related Report

[Journal Article] Decoupling of Simulation on Heterogeneous Clusters Using WaitIO2024

Author(s)

Journal Title

Related Report

[Journal Article] NVIDIA H100 PCIeおよびAMD MI210におけるN体計算コードの性能評価2023

Author(s)

Journal Title

Related Report

[Journal Article] Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP2023

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] 複数GPUでの埋め込み境界-格子ボルツマン法の計算の最適化と性能モデルの構築2023

Author(s)

Journal Title

Related Report

[Journal Article] oneAPIを用いた様々なデバイス上でのステンシル計算の実装2023

Author(s)

Journal Title

Related Report

[Journal Article] Accelerating lattice Boltzmann method simulation with GPU computation using C++ standard language parallelism2023

Author(s)

Journal Title

Related Report

[Journal Article] Solving Block Low-Rank Matrix Eigenvalue Problems2022

Author(s)

Journal Title

DOI

ISSN

Related Report

[Journal Article] 対称ブロック低ランク行列の固有値問題解法2022

Author(s)

Journal Title

Related Report

[Journal Article] 複数GPUを用いる際の埋め込み境界-格子ボルツマン法の性能向上2022

Author(s)

Journal Title

Related Report

[Journal Article] OpenMP Offloadingを用いたGPUでの格子ボルツマン法実行における性能評価2022

Author(s)

Journal Title

Related Report

[Journal Article] Large-scale earthquake sequence simulations on 3-D non-planar faults using the boundary element method accelerated by lattice H-matrices2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 余剰コアを活用したOpenMP TaskによるIn-situ解析の実現2022

Author(s)

Journal Title

Related Report

[Journal Article] A64FXにおけるテンポラルブロッキングの実装と性能評価2021

Author(s)