Creating out of GPU core computation techniques and their deployment over cycle-sharing systems

Research Project

Project/Area Number	23K28061
Project/Area Number (Other)	23H03371 (2023)
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Multi-year Fund (2024) Single-year Grants (2023)
Section	一般
Review Section	Basic Section 60050:Software-related
Research Institution	Osaka University
Principal Investigator	伊野文彦大阪大学, 大学院情報科学研究科, 教授 (90346172)
Co-Investigator(Kenkyū-buntansha)	置田真生大阪大学, 大学院情報科学研究科, 准教授 (50563988) 桝井晃基大阪大学, 大学院情報科学研究科, 助教 (70897793)
Project Period (FY)	2023-04-01 – 2027-03-31
Project Status	Granted (Fiscal Year 2024)
Budget Amount *help	¥18,590,000 (Direct Cost: ¥14,300,000、Indirect Cost: ¥4,290,000) Fiscal Year 2026: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000) Fiscal Year 2025: ¥9,100,000 (Direct Cost: ¥7,000,000、Indirect Cost: ¥2,100,000) Fiscal Year 2024: ¥2,210,000 (Direct Cost: ¥1,700,000、Indirect Cost: ¥510,000) Fiscal Year 2023: ¥5,330,000 (Direct Cost: ¥4,100,000、Indirect Cost: ¥1,230,000)
Keywords	アウトオブコア計算 / GPU計算 / 高速化 / 高性能計算
Outline of Research at the Start	本研究では，Graphics Processing Unit（GPU）メモリに乗り切らない大規模データに対する高速処理を目的として，GPU向けアウトオブコア計算技術を創出する．そのために，CPU・GPU間のデータ転送量を削減できるデータ再利用技術やGPU向けオンザフライ並列圧縮技術を開発し，量子シミュレータなどの実応用を，日常的に使われているGPUによるサイクル共有計算基盤上に展開することで，GPU向けアウトオブコア計算技術の有用性を示す．
Outline of Annual Research Achievements	本研究の目的は，GPUメモリに乗り切らない大規模データを高速処理できるアウトオブコア計算技術を記述系とともに創出することである．本技術は，これまで避けられてきたCPU・GPU間のデータ転送を敢えて活用するだけでなく，データ再利用技術やオンザフライ並列圧縮技術による効率化を施す．さらに，本技術を家庭やオフィスで日常的に使われているGPUによるサイクル共有システム上に量子シミュレータなどの大規模応用を展開する．これら未解明課題の解決により，GPUの弱点であるメモリ容量の制約を克服し，GPU向けアウトオブコア計算技術の有用性を示す．令和5年度は以下の課題に取り組んだ．まず，NVIDIA社のGPU向け開発環境Compute Unified Device Architecture（CUDA）で記述されたステンシル計算を拡張し，データ再利用技術やオンザフライ並列圧縮技術を組み込んだ実装を開発した．開発した実装は，データ再利用技術の効果を高めるために，GPUメモリ使用量を削減できるバッファ方式を持つ．実験の結果，この方式はメモリ使用量を21%削減し，データ再利用の範囲を拡げることができた．これにより2.1倍の高速化を達成した．次に，ルーフラインモデルに基づく性能モデルを構築し，オンザフライ圧縮による高速化が見込めるステンシルの形状を分析した．結果，参照数の少ない星型ステンシルに対してはデータ圧縮による性能改善が見込めることが確認できた．一方，参照範囲の広い箱型ステンシルはGPU上の計算が性能を支配していて，データ圧縮による性能改善は難しいことが判明した．最後に，現有設備の持つ8台のGPUをPCIe4.0バスで接続できる最新のGPUに更新し，来年度以降の開発を遂行し性能を評価するためのテストベッドを構築した．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 当初計画における順番を変更し，特定のステンシル計算においてデータ再利用ならびにオンザフライ圧縮を先行して実現した．一方，記述系の実現については準備を進めているところである．一部を先行して実現できたことから，進捗は計画通りである．
Strategy for Future Research Activity	当初の計画にしたがい，GPUメモリに乗り切らない大規模データを高速処理できるアウトオブコア計算技術を実現していく．また，最適化手法に関する成果を国内外で発表し，他の研究者からの意見を反映していく予定である．

Report

(1 results)

2023 Annual Research Report

Research Products
(7 results)

All 2023 Other

All Int'l Joint Research (1 results) Journal Article (2 results) (of which Int'l Joint Research: 2 results, Peer Reviewed: 2 results) Presentation (3 results) (of which Int'l Joint Research: 1 results) Remarks (1 results)

[Int'l Joint Research] 上海交通大学/重慶郵電大学(中国)
- Related Report
  2023 Annual Research Report
[Journal Article] A compression-based memory-efficient optimization for out-of-core GPU stencil computation2023
- Author(s)
  Jingcheng Shen, Linbo Long, Xin Deng, Masao Okita, Fumihiko Ino
- Journal Title
  
  The Journal of Supercomputing
  
  Volume: － Issue: 10 Pages: 11055-11077
- DOI
  10.1007/s11227-023-05103-8
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Parallel Heuristic Methods to Accelerate Best Equivocation Code Generation2023
- Author(s)
  Li Yanchen、Zhang Ke、Ino Fumihiko
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 44136-44149
- DOI
  10.1109/access.2023.3272864
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Presentation] PRF: A Fast Parallel Relaxed Flooding Algorithm for Voronoi Diagram Generation on GPU2023
- Author(s)
  Jue Wang, Fumihiko Ino, and Jing Ke
- Organizer
  37th IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2023)
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] 自由視点画像を高速に生成するためのオンラインレンダリング2023
- Author(s)
  荒川雄登, 置田真生, 伊野文彦
- Organizer
  第22回情報科学技術フォーラム
- Related Report
  2023 Annual Research Report
[Presentation] 高周波電磁界シミュレーションにおけるICCG法の並列化2023
- Author(s)
  小山雄世, 桝井晃基, 伊野文彦
- Organizer
  情報処理学会ハイパフォーマンスコンピューティング研究会
- Related Report
  2023 Annual Research Report
[Remarks] 大阪大学　大学院情報科学研究科　並列処理工学講座
- URL
  http://www-ppl.ist.osaka-u.ac.jp/
- Related Report
  2023 Annual Research Report

Creating out of GPU core computation techniques and their deployment over cycle-sharing systems

Principal Investigator

伊野 文彦 大阪大学, 大学院情報科学研究科, 教授 (90346172)

¥18,590,000 (Direct Cost: ¥14,300,000、Indirect Cost: ¥4,290,000)

Current Status of Research Progress

Reason

Report

Research Products

[Int'l Joint Research] 上海交通大学/重慶郵電大学(中国)

Related Report

[Journal Article] A compression-based memory-efficient optimization for out-of-core GPU stencil computation2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Parallel Heuristic Methods to Accelerate Best Equivocation Code Generation2023

Author(s)

Journal Title

DOI

Related Report

[Presentation] PRF: A Fast Parallel Relaxed Flooding Algorithm for Voronoi Diagram Generation on GPU2023

Author(s)

Organizer

Related Report

[Presentation] 自由視点画像を高速に生成するためのオンラインレンダリング2023

Author(s)

Organizer

Related Report

[Presentation] 高周波電磁界シミュレーションにおけるICCG法の並列化2023

Author(s)

Organizer

Related Report

[Remarks] 大阪大学 大学院情報科学研究科 並列処理工学講座

URL

Related Report

伊野文彦大阪大学, 大学院情報科学研究科, 教授 (90346172)

[Remarks] 大阪大学　大学院情報科学研究科　並列処理工学講座