Auto-tuning FFT using GPU
Project/Area Number |
22680002
|
Research Category |
Grant-in-Aid for Young Scientists (A)
|
Allocation Type | Single-year Grants |
Research Field |
Software
|
Research Institution | Tokyo Institute of Technology |
Principal Investigator |
NUKADA Akira 東京工業大学, 学術国際情報センター, 産学官連携研究員 (40545688)
|
Project Period (FY) |
2010 – 2011
|
Project Status |
Completed (Fiscal Year 2011)
|
Budget Amount *help |
¥11,700,000 (Direct Cost: ¥9,000,000、Indirect Cost: ¥2,700,000)
Fiscal Year 2011: ¥5,200,000 (Direct Cost: ¥4,000,000、Indirect Cost: ¥1,200,000)
Fiscal Year 2010: ¥6,500,000 (Direct Cost: ¥5,000,000、Indirect Cost: ¥1,500,000)
|
Keywords | ソフトウェア工学 / 高速フーリエ変換 / GPU / CUDA / 自動チューニング / OpenCL |
Research Abstract |
We have developed the NukadaFFT library, which is an auto-tuning FFT library for NVIDIA CUDA GPUs. It outperforms NVIDIA's CUFFT library in many cases. We also implemented a multi-GPU version for both single-node with multi-GPUs and multi-node, and achieved further speed-up.
|
Report
(3 results)
Research Products
(34 results)
-
-
-
-
-
-
[Presentation] High Performance 3-D FFT using multiple CUDA GPUs2012
Author(s)
Akira Nukada, Yutaka Maruyama, Satoshi Matsuoka
Organizer
In Proceedings of the Fifth Workshop on General Purpose Processing using Graphics Processing Units(GPGPU-5) in conjunction with ACM ASPLOS XVII
Place of Presentation
London, UK, ACM Press
Year and Date
2012-03-03
Related Report
-
-
-
[Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2. 0 Supercomputer2011
Author(s)
Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, and Satoshi Matsuoka
Organizer
In Proc. of 2011 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis(SC' 11)
Place of Presentation
Seattle, ACM Press
Year and Date
2011-11-15
Related Report
-
[Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer2011
Author(s)
Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, Satoshi Matsuoka
Organizer
2011 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis (SC'11)
Place of Presentation
Seattle, WA, USA
Year and Date
2011-11-15
Related Report
-
-
-
-
-
-
-
-
-
-
-
-
[Presentation] An 80-Fold Speedup, 15. 0 TFlops, Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code2010
Author(s)
Takashi Shimokawabe, Takayuki Aoki, Chiashi Muroi, Junichi Ishida, Kohei Kawano, Toshio Endo, Akira Nukada, Naoya Maruyama and Satoshi Matsuoka
Organizer
In Proc. of the 2010 ACM/IEEE conference on Supercomputing(SC' 10)
Place of Presentation
New Orleans, IEEE Press
Year and Date
2010-11-17
Related Report
-
-
-
-
-
-
-
-
-
-
-
-