ExaPath: Hierarchical Routing for Next-Gen Supercomputers and Beyond

研究課題

研究課題/領域番号	19H04119
研究種目	基盤研究(B)
配分区分	補助金
応募区分	一般
審査区分	小区分60090:高性能計算関連
研究機関	国立研究開発法人理化学研究所
研究代表者	ドンケイェンス国立研究開発法人理化学研究所, 計算科学研究センター, チームリーダー (70815480)
研究分担者	遠藤敏夫東京工業大学, 学術国際情報センター, 教授 (80396788)
研究期間 (年度)	2019-04-01 – 2024-03-31
研究課題ステータス	完了 (2023年度)
配分額 *注記	17,160千円 (直接経費: 13,200千円、間接経費: 3,960千円) 2023年度: 3,250千円 (直接経費: 2,500千円、間接経費: 750千円) 2022年度: 3,250千円 (直接経費: 2,500千円、間接経費: 750千円) 2021年度: 3,510千円 (直接経費: 2,700千円、間接経費: 810千円) 2020年度: 3,250千円 (直接経費: 2,500千円、間接経費: 750千円) 2019年度: 3,900千円 (直接経費: 3,000千円、間接経費: 900千円)
キーワード	HPC interconnects / routing algorithms / network design / artificial intelligance / message passing / routing / hierarchical / supercomputing
研究開始時の研究の概要	The research objective is the invention and development of a novel type of algorithms, which calculate the communication paths within supercomputer networks. These novel algorithms will be hierarchical to overcome scalability challenges of existing algorithms, which are insufficient for future system.
研究成果の概要	現代社会は、人工知能などのサイエンス分野に対応するため、ますます大きな計算能力を求めている。最近では、スーパーコンピューターがより大規模なシステムへとスケールアウトし始めている。これらのシステムのバックボーンは、適切なルーティングを必要とする相互接続ネットワークである。このプロセスは、グーグルマップがどの道を走ればいいかを教えてくれるのに似ている。スーパーコンピューターでは、ルーティングがメッセージに進むべき道を指示する。我々のプロジェクトは、何千台、何百万台ものコンピュータを接続する非常に複雑なネットワークのための新しいルーティング・アプローチを設計することを目的としている。
研究成果の学術的意義や社会的意義	Our developed routing algorithms, and methods to make supercomputer interconnects faster, will help other scientists to accelerate their workflows. Meaning, with optimal routing, the supercomputers can finish more scientific simulations, and hence the scientists can get more results in shorter time.

報告書

(6件)

研究成果
(25件)

すべて 2024 2023 2022 2021 2020 2019 その他

すべて雑誌論文 (11件) (うち国際共著 10件、査読あり 11件、オープンアクセス 1件) 学会発表 (11件) (うち国際学会 7件、招待講演 3件) 備考 (3件)

[雑誌論文] A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network2024
- 著者名/発表者名
  N. Blach, M. Besta, D.D. Sensi, J. Domke, H. Harake, S. Li, P. Iff, M. Konieczny, K. Lakhotia, A. Kubicek, M. Ferrari, F. Petrini, T. Hoefler
- 雑誌名
  
  21st USENIX Symposium on Networked Systems Design and Implementation (NSDI '24)
  
  巻: 0 ページ: 1025-1044
- 関連する報告書
  2023 実績報告書
- 査読あり / 国際共著
[雑誌論文] Myths and legends in high-performance computing2023
- 著者名/発表者名
  Matsuoka Satoshi、Domke Jens、Wahib Mohamed、Drozd Aleksandr、Hoefler Torsten
- 雑誌名
  
  The International Journal of High Performance Computing Applications
  
  巻: 37 号: 3-4 ページ: 245-259
- DOI
  10.1177/10943420231166608
- 関連する報告書
  2023 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs2023
- 著者名/発表者名
  Moses William S.、Ivanov Ivan R.、Domke Jens、Endo Toshio、Doerfert Johannes、Zinenko Oleksandr
- 雑誌名
  
  28th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '23
  
  巻: 0 ページ: 119-134
- DOI
  10.1145/3572848.3577475
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] Parallel Optimizations and Transformations of GPU Kernels Using a High-Level representation in MLIR/Polygeist2023
- 著者名/発表者名
  I.R. Ivanov, W.S. Moses, J. Domke, T. Endo
- 雑誌名
  
  IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2023
  
  巻: 0 ページ: 1-1
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs2022
- 著者名/発表者名
  W.S. Moses, I.R. Ivanov, J. Domke, T. Endo, J. Doerfert, O. Zinenko
- 雑誌名
  
  2022 LLVM Developers' Meeting
  
  巻: 0 ページ: 1-1
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] Automatic translation of CUDA code into high performance CPU code using LLVM IR transformations2022
- 著者名/発表者名
  I.R. Ivanov, J. Domke, T. Endo
- 雑誌名
  
  The 4rd R-CCS International Symposium (RCCS-IS4)
  
  巻: 0 ページ: 1-1
- 関連する報告書
  2021 実績報告書
- 査読あり
[雑誌論文] A64FX - Your Compiler You Must Decide!2021
- 著者名/発表者名
  J. Domke
- 雑誌名
  
  2021 IEEE International Conference on Cluster Computing (CLUSTER), EAHPC Workshop
  
  巻: 0 ページ: 1-5
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks2021
- 著者名/発表者名
  Maciej Besta, Jens Domke, Marcel Schneider, Marek Konieczny, Salvatore Di Girolamo, Timo Schneider, Ankit Singla, Torsten Hoefler
- 雑誌名
  
  IEEE Transactions on Parallel and Distributed Systems
  
  巻: 32 号: 4 ページ: 1-14
- DOI
  10.1109/tpds.2020.3035761
- 関連する報告書
  2020 実績報告書
- 査読あり / 国際共著
[雑誌論文] Improved failover for HPC interconnects through localised routing restoration2021
- 著者名/発表者名
  Ivan R. Ivanov, Jens Domke, Akihiro Nomura, Toshio Endo
- 雑誌名
  
  The 3rd R-CCS International Symposium (RCCS-IS3)
  
  巻: 0
- 関連する報告書
  2020 実績報告書
- 査読あり / 国際共著
[雑誌論文] HyperX Topology: First at-scale Implementation and Comparison to the Fat-Tree2019
- 著者名/発表者名
  Domke Jens、Matsuoka Satoshi、Ivanov Ivan R.、Tsushima Yuki、Yuki Tomoya、Nomura Akihiro、Miura Shin'ichi、McDonald Nie、Floyd Dennis L.、Dube Nicolas
- 雑誌名
  
  Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
  
  巻: SC'19 ページ: 1-23
- DOI
  10.1145/3295500.3356140
- 関連する報告書
  2019 実績報告書
- 査読あり / 国際共著
[雑誌論文] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019
- 著者名/発表者名
  Domke Jens、Matsuoka Satoshi、Radanov Ivan、Tsushima Yuki、Yuki Tomoya、Nomura Akihiro、Miura Shin'ichi、McDonald Nic、Floyd Dennis Lee、Dube Nicolas
- 雑誌名
  
  2019 IEEE Symposium on High-Performance Interconnects (HOTI)
  
  巻: HOTI'26 ページ: 4-4
- DOI
  10.1109/hoti.2019.00013
- 関連する報告書
  2019 実績報告書
- 査読あり / 国際共著
[学会発表] Advanced Architecture "Playgrounds" Past Lessons and Future Accesses of Testbeds ... an update by RIKEN R-CCS2023
- 著者名/発表者名
  J. Domke
- 学会等名
  International Conference for High Performance Computing, Networking, Storage and Analysis (SC '23)
- 関連する報告書
  2023 実績報告書
- 招待講演
[学会発表] Working with Proxy-Applications: Interesting Findings, Lessons Learned, and Future Directions2022
- 著者名/発表者名
  J. Domke
- 学会等名
  Benchmarking in the Data Center: Expanding to the Cloud (workshop) held in conjunction with PPoPP 2022: Principles and Practice of Parallel Programming 2022
- 関連する報告書
  2022 実績報告書
- 国際学会
[学会発表] Octopodes A candidate to replace Mini Apps and Motifs?2022
- 著者名/発表者名
  J. Domke
- 学会等名
  14th JLESC Workshop
- 関連する報告書
  2022 実績報告書
[学会発表] MocCUDA: Running Cuda Codes on Fugaku2022
- 著者名/発表者名
  J. Domke
- 学会等名
  SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP ’22)
- 関連する報告書
  2021 実績報告書
- 国際学会
[学会発表] MocCUDA: Running CUDA codes on Fugaku2021
- 著者名/発表者名
  Jens Domke
- 学会等名
  12th JLESC Workshop
- 関連する報告書
  2020 実績報告書
- 国際学会
[学会発表] Improved failover for HPC interconnects through localised routing restoration2021
- 著者名/発表者名
  Ivan R. Ivanov
- 学会等名
  The 3rd R-CCS International Symposium (RCCS-IS3)
- 関連する報告書
  2020 実績報告書
- 国際学会
[学会発表] The Bright Future for HPC Interconnects -- Opportunities, Challenges, and Misconceptions in Deployment and Management of Large-Scale Networks2020
- 著者名/発表者名
  Jens Domke
- 学会等名
  Focus Session: Leveraging Silicon Photonics in HPC to Meet Future Exascale Needs in 36th ISC High Performance (ISC ’21)
- 関連する報告書
  2020 実績報告書
- 国際学会
[学会発表] HyperX Topology: First at-scale Implementation and Comparison to the Fat-Tree2019
- 著者名/発表者名
  Domke Jens
- 学会等名
  International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19)
- 関連する報告書
  2019 実績報告書
- 国際学会
[学会発表] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019
- 著者名/発表者名
  Domke Jens
- 学会等名
  2019 IEEE Symposium on High-Performance Interconnects
- 関連する報告書
  2019 実績報告書
- 国際学会
[学会発表] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019
- 著者名/発表者名
  Domke Jens
- 学会等名
  The 179th R-CCS Cafe
- 関連する報告書
  2019 実績報告書
- 招待講演
[学会発表] First At-Scale HyperX Implementation: A Compelling Alternative to Fat-Trees?2019
- 著者名/発表者名
  Domke Jens
- 学会等名
  High Performance Consortium for Advanced Scientific and Technical Computing (HP-CAST 32)
- 関連する報告書
  2019 実績報告書
- 招待講演
[備考] MocCUDA
- URL
  https://gitlab.com/domke/MocCUDA
- 関連する報告書
  2022 実績報告書
[備考] Repo for thesis of localised routing restoration:
- URL
  https://gitlab.com/ivanradanov/localisedrerouting
- 関連する報告書
  2020 実績報告書
[備考] TSUBAME2 HyperX experiment
- URL
  https://gitlab.com/domke/t2hx
- 関連する報告書
  2019 実績報告書

ExaPath: Hierarchical Routing for Next-Gen Supercomputers and Beyond

研究代表者

ドンケ イェンス 国立研究開発法人理化学研究所, 計算科学研究センター, チームリーダー (70815480)

17,160千円 (直接経費: 13,200千円、間接経費: 3,960千円)

報告書

研究成果

[雑誌論文] A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network2024

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] Myths and legends in high-performance computing2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Parallel Optimizations and Transformations of GPU Kernels Using a High-Level representation in MLIR/Polygeist2023

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs2022

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] Automatic translation of CUDA code into high performance CPU code using LLVM IR transformations2022

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] A64FX - Your Compiler You Must Decide!2021

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks2021

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Improved failover for HPC interconnects through localised routing restoration2021

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] HyperX Topology: First at-scale Implementation and Comparison to the Fat-Tree2019

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019

著者名/発表者名

雑誌名

DOI

関連する報告書

[学会発表] Advanced Architecture "Playgrounds" Past Lessons and Future Accesses of Testbeds ... an update by RIKEN R-CCS2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Working with Proxy-Applications: Interesting Findings, Lessons Learned, and Future Directions2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Octopodes A candidate to replace Mini Apps and Motifs?2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] MocCUDA: Running Cuda Codes on Fugaku2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] MocCUDA: Running CUDA codes on Fugaku2021

著者名/発表者名

学会等名

関連する報告書

[学会発表] Improved failover for HPC interconnects through localised routing restoration2021

著者名/発表者名

学会等名

関連する報告書

[学会発表] The Bright Future for HPC Interconnects -- Opportunities, Challenges, and Misconceptions in Deployment and Management of Large-Scale Networks2020

ドンケイェンス国立研究開発法人理化学研究所, 計算科学研究センター, チームリーダー (70815480)