2011 Fiscal Year Annual Research Report

GPUによるFFT計算の自動チューニング手法の研究

Research Project

Project/Area Number	22680002
Research Institution	Tokyo Institute of Technology
Principal Investigator	額田彰東京工業大学, 学術国際情報センター, 産学官連携研究員 (40545688)
Keywords	高速フーリエ変換 / GPU / CUDA / 自動チューニング
Research Abstract	GPGPUで現在主流となっているのはCUDA環境であるが,NVIDIA製GPUのみを対象としているという制限がある.他のベンダのGPUやメニーコアプロセッサも対象とする共通のプログラム環境の中でCUDAに一番近いものがOpenCLである.OpenCL環境ではCUDAとほぼ同じ機能が提供されており,特にFFTの計算を実行するカーネル部分のコードは似ている.唯一の差異はFFTで利用する三角関数のテープルの扱いで,CUDAではテクスチャメモリを利用していたがOpenCLではコンスタントメモリを使用した.AMD製RADEON HD7970を用いてCUDA+NVIDIA製GPUを超える性能を実現することができた.一方,性能のポータビリティはOpenCLでは確保されていない.NVIDIA製GPUではCUDA版と比べるとOpenCL版の性能は現時点ではかなり劣る.今後ドライバの成熟を待てばこの差は縮まることが期待される. さらに複数GPUへの対応も進めている.特にGPUのデバイスメモリ容量は限られるため,実アプリケーションでは容量を確保するために複数GPUを利用するケースも少なくない.GPU間の転送はPCI-ExpressインターフェイスやInfiniBandなどのノード間インターコネクトの性能に大きく依存し,実行時間の大部分をGPU間のall-to-all通信が占めることになる.CUDAではversion4.0からこのGPU間通信を強化しており,PCI-Expressネットワークを介したGPU問の直接通信や,InfinlBand HCAとの協調動作などがサポートされた.これらの機能を活用することで,通信が全く必要ないシングルGPU実行時と比べても4GPU搭載システムで約2倍,64ノード64GPUのクラスタで最大13倍の性能向上を実現した.通信関連の各種自動最適化は今後の課題である。

Research Products
(7 results)

All 2012 2011 Other

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (5 results) Remarks (1 results)

[Journal Article] スーパーコンピュータTSUBAME 2.0におけるLinpack性能1ペタフロップス超の達成2011
- Author(s)
  遠藤敏夫, 額田彰, 松岡聡
- Journal Title
  
  情報処理学会論文誌コンピューティングシステム
  
  Volume: Vol.4,No.4 Pages: 169-179
- Peer Reviewed
[Presentation] High Performance 3-D FFT using multiple CUDA GPUs2012
- Author(s)
  Akira Nukada, Yutaka Maruyama, Satoshi Matsuoka
- Organizer
  Fifth Workshop on General Purpose Processing using Graphics Processing Units (GPGPU-5)in conjunction with ACM ASPLOS XVII
- Place of Presentation
  London, UK
- Year and Date
  2012-03-03
[Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer2011
- Author(s)
  Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, Satoshi Matsuoka
- Organizer
  2011 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis (SC'11)
- Place of Presentation
  Seattle, WA, USA
- Year and Date
  2011-11-15
[Presentation] Hamming Color Code for Dense and Robust One-shot 3D Scanning2011
- Author(s)
  Shuntaro Yamazaki, Akira Nukada, Masaaki Mochimaru
- Organizer
  2011 British Machine Vision Conference
- Place of Presentation
  Dundee, Scotland
- Year and Date
  2011-08-30
[Presentation] Fast Fourier Transform for AMD GPUs2011
- Author(s)
  Akira Nukada
- Organizer
  AMD Fusion Developer Summit 2011
- Place of Presentation
  Bellevue, WA, USA
- Year and Date
  2011-06-15
[Presentation] NVCR : A Transparent Checkpoint-Restart Library for NVIDIA CUDA2011
- Author(s)
  Akira Nukada, Hiroyuki Takizawa, Satoshi Matsuoka
- Organizer
  20th Heterogeneity in Computing Workshop (HCW 2011), in conjunction with IEEE IPDPS 2011
- Place of Presentation
  Anchorage, AK, USA
- Year and Date
  2011-05-16
[Remarks]
- URL
  http://matsu-www.is.titech.ac.jp/~nukada/nufft/

2011 Fiscal Year Annual Research Report

GPUによるFFT計算の自動チューニング手法の研究

Principal Investigator

額田 彰 東京工業大学, 学術国際情報センター, 産学官連携研究員 (40545688)

Research Products

[Journal Article] スーパーコンピュータTSUBAME 2.0におけるLinpack性能1ペタフロップス超の達成2011

Author(s)

Journal Title

[Presentation] High Performance 3-D FFT using multiple CUDA GPUs2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Hamming Color Code for Dense and Robust One-shot 3D Scanning2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Fast Fourier Transform for AMD GPUs2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] NVCR : A Transparent Checkpoint-Restart Library for NVIDIA CUDA2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Remarks]

URL

額田彰東京工業大学, 学術国際情報センター, 産学官連携研究員 (40545688)