• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Auto-tuning FFT using GPU

Research Project

Project/Area Number 22680002
Research Category

Grant-in-Aid for Young Scientists (A)

Allocation TypeSingle-year Grants
Research Field Software
Research InstitutionTokyo Institute of Technology

Principal Investigator

NUKADA Akira  東京工業大学, 学術国際情報センター, 産学官連携研究員 (40545688)

Project Period (FY) 2010 – 2011
Project Status Completed (Fiscal Year 2011)
Budget Amount *help
¥11,700,000 (Direct Cost: ¥9,000,000、Indirect Cost: ¥2,700,000)
Fiscal Year 2011: ¥5,200,000 (Direct Cost: ¥4,000,000、Indirect Cost: ¥1,200,000)
Fiscal Year 2010: ¥6,500,000 (Direct Cost: ¥5,000,000、Indirect Cost: ¥1,500,000)
Keywordsソフトウェア工学 / 高速フーリエ変換 / GPU / CUDA / 自動チューニング / OpenCL
Research Abstract

We have developed the NukadaFFT library, which is an auto-tuning FFT library for NVIDIA CUDA GPUs. It outperforms NVIDIA's CUFFT library in many cases. We also implemented a multi-GPU version for both single-node with multi-GPUs and multi-node, and achieved further speed-up.

Report

(3 results)
  • 2011 Annual Research Report   Final Research Report ( PDF )
  • 2010 Annual Research Report
  • Research Products

    (34 results)

All 2012 2011 2010 Other

All Journal Article (5 results) (of which Peer Reviewed: 5 results) Presentation (24 results) Book (1 results) Remarks (4 results)

  • [Journal Article] スーパーコンピュータTSUBAME 2. 0におけるLinpack性能1ペタフロップス超の達成2011

    • Author(s)
      遠藤敏夫, 額田彰, 松岡聡
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: Vol.4, No.4(ACS35) Pages: 169-179

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] スーパーコンピュータTSUBAME 2.0におけるLinpack性能1ペタフロップス超の達成2011

    • Author(s)
      遠藤敏夫, 額田彰, 松岡聡
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: Vol.4,No.4 Pages: 169-179

    • NAID

      40019259212

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] CUDAによる高速フーリエ変換,応用数理2010

    • Author(s)
      額田彰
    • Journal Title

      応用数理学会

      Volume: 第20巻,第2号 Pages: 37-43

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] 異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価,応用数理2010

    • Author(s)
      遠藤敏夫, 額田彰, 松岡聡
    • Journal Title

      応用数理学会

      Volume: 第20巻,第2号 Pages: 29-36

    • Related Report
      2011 Final Research Report
    • Peer Reviewed
  • [Journal Article] CUDAによる高速フーリエ変換2010

    • Author(s)
      額田彰
    • Journal Title

      応用数理

      Volume: 第20巻第2号 Pages: 37-43

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Presentation] High Performance 3-D FFT using multiple CUDA GPUs2012

    • Author(s)
      Akira Nukada, Yutaka Maruyama, Satoshi Matsuoka
    • Organizer
      In Proceedings of the Fifth Workshop on General Purpose Processing using Graphics Processing Units(GPGPU-5) in conjunction with ACM ASPLOS XVII
    • Place of Presentation
      London, UK, ACM Press
    • Year and Date
      2012-03-03
    • Related Report
      2011 Final Research Report
  • [Presentation] High Performance 3-D FFT using multiple CUDA GPUs2012

    • Author(s)
      Akira Nukada, Yutaka Maruyama, Satoshi Matsuoka
    • Organizer
      Fifth Workshop on General Purpose Processing using Graphics Processing Units (GPGPU-5)in conjunction with ACM ASPLOS XVII
    • Place of Presentation
      London, UK
    • Year and Date
      2012-03-03
    • Related Report
      2011 Annual Research Report
  • [Presentation] グリーンスパコンTSUBAME2. 0における電力危機対応運用2011

    • Author(s)
      遠藤敏夫,松岡聡,額田彰,長坂真路,四津匡康
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      札幌
    • Year and Date
      2011-11-28
    • Related Report
      2011 Final Research Report
  • [Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2. 0 Supercomputer2011

    • Author(s)
      Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, and Satoshi Matsuoka
    • Organizer
      In Proc. of 2011 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis(SC' 11)
    • Place of Presentation
      Seattle, ACM Press
    • Year and Date
      2011-11-15
    • Related Report
      2011 Final Research Report
  • [Presentation] Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer2011

    • Author(s)
      Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, Satoshi Matsuoka
    • Organizer
      2011 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis (SC'11)
    • Place of Presentation
      Seattle, WA, USA
    • Year and Date
      2011-11-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] Hamming Color Code for Dense and Robust One-shot 3D Scanning2011

    • Author(s)
      Shuntaro Yamazaki, Akira Nukada, Masaaki Mochimaru
    • Organizer
      In Proc. of the 2011 British Machine Vision Conference
    • Place of Presentation
      Dundee, Scotland, Springer
    • Year and Date
      2011-08-30
    • Related Report
      2011 Final Research Report
  • [Presentation] Hamming Color Code for Dense and Robust One-shot 3D Scanning2011

    • Author(s)
      Shuntaro Yamazaki, Akira Nukada, Masaaki Mochimaru
    • Organizer
      2011 British Machine Vision Conference
    • Place of Presentation
      Dundee, Scotland
    • Year and Date
      2011-08-30
    • Related Report
      2011 Annual Research Report
  • [Presentation] Fast Fourier Transform for AMD GPUs2011

    • Author(s)
      Akira Nukada
    • Organizer
      AMD Fusion Developer Summit 2011
    • Place of Presentation
      Bellevue, WA
    • Year and Date
      2011-06-15
    • Related Report
      2011 Final Research Report
  • [Presentation] Fast Fourier Transform for AMD GPUs2011

    • Author(s)
      Akira Nukada
    • Organizer
      AMD Fusion Developer Summit 2011
    • Place of Presentation
      Bellevue, WA, USA
    • Year and Date
      2011-06-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] スーパーコンピュータTSUBAME 2. 0におけるLinpack性能1ペタフロップス超の達成2011

    • Author(s)
      遠藤敏夫,額田彰,松岡聡
    • Organizer
      先進的計算基盤システムシンポジウム(SACSIS2011)
    • Place of Presentation
      秋葉原
    • Year and Date
      2011-05-27
    • Related Report
      2011 Final Research Report
  • [Presentation] NVCR : A Transparent Checkpoint-Restart Library for NVIDIA CUDA2011

    • Author(s)
      Akira Nukada, Hiroyuki Takizawa, and Satoshi Matsuoka
    • Organizer
      In Proc. of 20th Heterogeneity in Computing Workshop(HCW 2011), in conjunction with IPDPS 2011
    • Place of Presentation
      Anchorage, AK, USA
    • Year and Date
      2011-05-16
    • Related Report
      2011 Final Research Report
  • [Presentation] NVCR : A Transparent Checkpoint-Restart Library for NVIDIA CUDA2011

    • Author(s)
      Akira Nukada, Hiroyuki Takizawa, Satoshi Matsuoka
    • Organizer
      20th Heterogeneity in Computing Workshop (HCW 2011), in conjunction with IEEE IPDPS 2011
    • Place of Presentation
      Anchorage, AK, USA
    • Year and Date
      2011-05-16
    • Related Report
      2011 Annual Research Report
  • [Presentation] Low-overhead diskless checkpoint for hybrid computing systems2010

    • Author(s)
      Leonardo Bautista Gomez, Akira Nukada, Naoya Maruyama, Franck Cappello and Satoshi Matsuoka
    • Organizer
      In Proc. of International Conference on High Performance Computing(HiPC 2010)
    • Place of Presentation
      Goa, India
    • Year and Date
      2010-12-20
    • Related Report
      2011 Final Research Report
  • [Presentation] Efficient PageRank on GPU Clusters2010

    • Author(s)
      Ali Cevehir, Cevdet Aykanat, Ata Turk, B. Barla Cambazoglu, Akira Nukada and Satoshi Matsuoka
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      札幌
    • Year and Date
      2010-12-17
    • Related Report
      2011 Final Research Report
  • [Presentation] ヘテロ型スーパーコンピュータTSUBAME 2. 0のLinpackによる性能評価2010

    • Author(s)
      遠藤敏夫,額田彰,松岡聡
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      札幌
    • Year and Date
      2010-12-16
    • Related Report
      2011 Final Research Report
  • [Presentation] GPUにおけるモデルに基づいた電力効率の最適化2010

    • Author(s)
      長坂仁,丸山直也,額田彰,遠藤敏夫,松岡聡
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      札幌
    • Year and Date
      2010-12-16
    • Related Report
      2011 Final Research Report
  • [Presentation] An 80-Fold Speedup, 15. 0 TFlops, Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code2010

    • Author(s)
      Takashi Shimokawabe, Takayuki Aoki, Chiashi Muroi, Junichi Ishida, Kohei Kawano, Toshio Endo, Akira Nukada, Naoya Maruyama and Satoshi Matsuoka
    • Organizer
      In Proc. of the 2010 ACM/IEEE conference on Supercomputing(SC' 10)
    • Place of Presentation
      New Orleans, IEEE Press
    • Year and Date
      2010-11-17
    • Related Report
      2011 Final Research Report
  • [Presentation] NukadaFFT : An Auto-Tuning FFT Library for CUDA GPUs2010

    • Author(s)
      Akira Nukada and Satoshi Matsuoka
    • Organizer
      NVIDIA GPU Technology Conference 2010
    • Place of Presentation
      Research Summit Poster, San Jose
    • Year and Date
      2010-09-22
    • Related Report
      2011 Final Research Report
  • [Presentation] Statistical Power Modeling of GPU Kernels Using Performance Counters2010

    • Author(s)
      Hitoshi Nagasaka, Naoya Maruyama, Akira Nukada, Toshio Endo and Satoshi Matsuoka
    • Organizer
      Proceedings of the First International Green Computing Conference(IGCC' 10)
    • Place of Presentation
      Chicago
    • Year and Date
      2010-08-17
    • Related Report
      2011 Final Research Report
  • [Presentation] High Performance Conjugate Gradient Solver on Multi-GPU Clusters Using Hypergraph Partitioning2010

    • Author(s)
      Ali Cevahir, Akira Nukada, and Satoshi Matsuoka
    • Organizer
      Computer Science. Research and Development
    • Place of Presentation
      Hamburg, Germany
    • Year and Date
      2010-05-31
    • Related Report
      2011 Final Research Report
  • [Presentation] Fast Fourier Transform using CUDA GPUs2010

    • Author(s)
      Akira Nukada and Satoshi Matsuoka
    • Organizer
      ETHZ-Tokyo Tech Workshop : Computing with GPUs, Cells, and Multicores
    • Place of Presentation
      Zurich, Switzerland
    • Year and Date
      2010-05-11
    • Related Report
      2011 Final Research Report 2010 Annual Research Report
  • [Presentation] Linpack Evaluation on a Supercomputer with Heterogeneous Accelerators2010

    • Author(s)
      Toshio Endo, Akira Nukada, Satoshi Matsuoka, and Naoya Maruyama
    • Organizer
      In Proceedings of 24th IEEE International Parallel & Distributed Processing Symposium(IPDPS 2010)
    • Place of Presentation
      Atlanta
    • Year and Date
      2010-04-21
    • Related Report
      2011 Final Research Report
  • [Presentation] A High-Performance Fault-Tolerant Software Framework for Memory on Commodity GPUs2010

    • Author(s)
      Naoya Maruyama, Akira Nukada, and Satoshi Matsuoka
    • Organizer
      In Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium
    • Place of Presentation
      Atlanta
    • Year and Date
      2010-04-20
    • Related Report
      2011 Final Research Report
  • [Presentation] NukadaFFT : An Auto-Tuning FFT Library for CUDA GPUs2010

    • Author(s)
      Akira Nukada, Satoshi Matsuoka
    • Organizer
      NVIDIA GPU Technology Conference 2010
    • Place of Presentation
      San Jose, CA, USA
    • Related Report
      2010 Annual Research Report
  • [Book] Chapter 11 of "Software Automatic Tuning : From Concepts to the State-of-the-Art Results"2010

    • Author(s)
      Tamito Kajiyama, Akira Nukada, Reiji Suda, Hidehiko Hasegawa, and Akira Nishida
    • Related Report
      2011 Final Research Report
  • [Remarks] 下記URLにて本研究の成果の一部であるNukadaFFTライブラリソフトウェアを公開

    • URL

      http://matsu-www.is.titech.ac.jp/~nukada/nufft/

    • Related Report
      2011 Final Research Report
  • [Remarks]

    • URL

      http://matsu-www.is.titech.ac.jp/~nukada/nufft/

    • Related Report
      2011 Annual Research Report
  • [Remarks] 上記Webページにてライブラリソフトウェアを配布

    • URL

      http://matsu-www.is.titech.ac.jp/~nukada/nufft/

    • Related Report
      2010 Annual Research Report
  • [Remarks] 9月公開以降、263ダウンロード(3/24時点)

    • Related Report
      2010 Annual Research Report

URL: 

Published: 2010-08-23   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi