ＧＰＵスパコンのための３倍・４倍精度線形演算ライブラリの開発に関する研究

Research Project

Project/Area Number	13J01290
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Research Field	Software
Research Institution	The Institute of Physical and Chemical Research (2014) University of Tsukuba (2013)
Principal Investigator	椋木大地独立行政法人理化学研究所, 計算科学研究機構, 特別研究員(PD)
Project Period (FY)	2013-04-01 – 2015-03-31
Project Status	Completed (Fiscal Year 2014)
Budget Amount *help	¥2,070,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥270,000) Fiscal Year 2014: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2013: ¥900,000 (Direct Cost: ¥900,000)
Keywords	GPU / BLAS / 拡張精度 / 4倍精度演算 / 拡張精度演算 / 疎行列 / 線形計算 / 反復解法
Outline of Annual Research Achievements	本研究の目的は，GPUスパコンにおける3倍・4倍精度演算の実用化を目的として，GPUにおける高性能な3倍・4倍精度線形計算ライブラリの実現に向けた基礎研究を行うことであった．本年度は主として，GPUにおける複数の演算精度に対応した線形計算ライブラリの効率的な実装手法に関する研究を行った，その結果として，複数のNVIDIA GPUアーキテクチャに対応した高速な行列ベクトル積ルーチン（GEMV）の実装手法を開発した．本実装ではGPUにおけるプログラムの実行メカニズムをモデル化し，実行効率が最大となるようなスレッドブロックサイズを自動的に決定するオンライン自動チューニングを採用する．これにより既存の実装と比べ，実行環境や問題サイズに依存して生じる性能の変動を防ぎ，常に高い性能を維持できる．本手法は，ある線形計算を行うプログラム（例えばBLASルーチンなど）において演算精度が異なる複数バージョンを実装・最適化する上で有効であると考えられる．またこの他に，4倍精度演算手法の応用として，倍精度演算性能が単精度演算性能の1/32であるNVIDIA社の最新GPUにおいて，ソフトウェアエミュレーションによる疑似倍精度演算を実装し，倍精度行列積ルーチン（DGEMM）においてハードウェア処理による実装を上回る性能が得られることを示した．本年度に開発したGPU向けソフトウェアの一部は，オープンソースのライブラリとしてウェブ上で公開しており，今後も開発を継続する予定である．
Research Progress Status	26年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	26年度が最終年度であるため、記入しない。

Report

(2 results)

2014 Annual Research Report
2013 Annual Research Report

Research Products
(12 results)

All 2015 2014 2013

All Journal Article (5 results) (of which Acknowledgement Compliant: 3 results, Peer Reviewed: 3 results) Presentation (7 results)

[Journal Article] Fast Implementation of General Matrix-Vector Multiplication (GEMV) on Kepler GPUs2015
- Author(s)
  Daichi Mukunoki, Toshiyuki Imamura, Daisuke Takahashi
- Journal Title
  
  Proc. 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP2015)
  
  Volume: なし Pages: 642-650
- DOI
  10.1109/pdp.2015.66
- Related Report
  2014 Annual Research Report
- Peer Reviewed / Acknowledgement Compliant
[Journal Article] NVIDIA GPUにおけるGEMVカーネルの自動チューニング2015
- Author(s)
  椋木大地，今村俊幸，高橋大介
- Journal Title
  
  計算工学講演会論文集
  
  Volume: 20
- NAID
  40020496105
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] MaxwellアーキテクチャGPUにおける擬似倍精度演算を用いたDGEMMの実装と評価2014
- Author(s)
  椋木大地，今村俊幸
- Journal Title
  
  情報処理学会研究報告：ハイパフォーマンスコンピューティング
  
  Volume: 2014-HPC-147 Pages: 1-6
- Related Report
  2014 Annual Research Report
- Acknowledgement Compliant
[Journal Article] Using Quadruple Precision Arithmetic to Accelerate Krylov Subspace Methods on GPUs2014
- Author(s)
  Daichi Mukunoki, Daisuke Takahashi
- Journal Title
  
  Parallel Processing and Applied Mathematics-PPAM 2013, Part I, LNCS 8384
  
  Volume: (印刷中) Pages: 632-642
- DOI
  10.1007/978-3-642-55224-3_59
- ISBN
  9783642552236, 9783642552243
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Journal Article] Optimization of Sparse Matrix-vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs2013
- Author(s)
  Daichi Mukunoki, Daisuke Takahashi
- Journal Title
  
  Computational Science and Its Applications-ICCSA 2013, LNCS 7975
  
  Volume: 7975 Pages: 211-223
- DOI
  10.1007/978-3-642-39640-3_15
- ISBN
  9783642396397, 9783642396403
- Related Report
  2013 Annual Research Report
- Peer Reviewed
[Presentation] High-Performance GEMV and SYMV with Auto-Tuning for Performance Stabilization on Multiple GPU Generations2015
- Author(s)
  Daichi Mukunoki, Toshiyuki Imamura, Daisuke Takahashi
- Organizer
  GPU Technology Conference (GTC 2015)
- Place of Presentation
  San Jose (USA)
- Year and Date
  2015-03-16 – 2015-03-20
- Related Report
  2014 Annual Research Report
[Presentation] Kepler・MaxwellアーキテクチャGPUにおける性能が行列形状に依存しない高速なGEMVの実装2015
- Author(s)
  椋木大地，今村俊幸，高橋大介
- Organizer
  Annual Meeting on Advanced Computing System and Infrastructure (ACSI 2015)
- Place of Presentation
  つくば国際会議場（茨城県つくば市）
- Year and Date
  2015-01-27
- Related Report
  2014 Annual Research Report
[Presentation] KeplerアーキテクチャGPUにおける高速なSGEMVの実装2014
- Author(s)
  椋木大地，今村俊幸，高橋大介
- Organizer
  GTC Japan 2014
- Place of Presentation
  東京ミッドタウンホール＆カンファレンス（東京都）
- Year and Date
  2014-07-16
- Related Report
  2014 Annual Research Report
[Presentation] Linear Algebra Operations using Quadruple-precision Arithmetic on GPU2014
- Author(s)
  Daichi Mukunoki, Daisuke Takahashi
- Organizer
  GPU Technology Conference (GTC 2014)
- Place of Presentation
  San Jose, USA
- Year and Date
  2014-03-24
- Related Report
  2013 Annual Research Report
[Presentation] Using Quadruple Precision Arithmetic to Accelerate Krylov Subspace Methods on GPUs2013
- Author(s)
  Daichi Mukunoki, Daisuke Takahashi
- Organizer
  10th International Conference on Parallel Processing and Applied Mathematics (PPAM 2013)
- Place of Presentation
  Warszawa, Poland
- Year and Date
  2013-09-10
- Related Report
  2013 Annual Research Report
[Presentation] GPUにおける4倍精度浮動小数点演算を用いたクリロフ部分空間法の高速化2013
- Author(s)
  椋木大地, 高橋大介
- Organizer
  2013年並列/分散/協調処理に関する『北九州』サマー・ワークショップ(SWoPP北九州2013)
- Place of Presentation
  北九州市
- Year and Date
  2013-08-02
- Related Report
  2013 Annual Research Report
[Presentation] Optimization of Sparse Matrix-vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs2013
- Author(s)
  Daichi Mukunoki, Daisuke Takahashi
- Organizer
  13th International Conference on Computational Science and Its Applications (ICCSA 2013)
- Place of Presentation
  Ho Chi Minh City, Vietnam
- Year and Date
  2013-06-27
- Related Report
  2013 Annual Research Report

ＧＰＵスパコンのための３倍・４倍精度線形演算ライブラリの開発に関する研究

Principal Investigator

椋木 大地 独立行政法人理化学研究所, 計算科学研究機構, 特別研究員(PD)

¥2,070,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥270,000)

Report

Research Products

[Journal Article] Fast Implementation of General Matrix-Vector Multiplication (GEMV) on Kepler GPUs2015

Author(s)

Journal Title

DOI

Related Report

[Journal Article] NVIDIA GPUにおけるGEMVカーネルの自動チューニング2015

Author(s)

Journal Title

NAID

Related Report

[Journal Article] MaxwellアーキテクチャGPUにおける擬似倍精度演算を用いたDGEMMの実装と評価2014

Author(s)

Journal Title

Related Report

[Journal Article] Using Quadruple Precision Arithmetic to Accelerate Krylov Subspace Methods on GPUs2014

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Optimization of Sparse Matrix-vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs2013

Author(s)

Journal Title

DOI

ISBN

Related Report

[Presentation] High-Performance GEMV and SYMV with Auto-Tuning for Performance Stabilization on Multiple GPU Generations2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Kepler・MaxwellアーキテクチャGPUにおける性能が行列形状に依存しない高速なGEMVの実装2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] KeplerアーキテクチャGPUにおける高速なSGEMVの実装2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Linear Algebra Operations using Quadruple-precision Arithmetic on GPU2014

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Using Quadruple Precision Arithmetic to Accelerate Krylov Subspace Methods on GPUs2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] GPUにおける4倍精度浮動小数点演算を用いたクリロフ部分空間法の高速化2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Optimization of Sparse Matrix-vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs2013

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

椋木大地独立行政法人理化学研究所, 計算科学研究機構, 特別研究員(PD)