Research on Integration of Communication and Computation by Tightly Coupled Accelerators

Research Project

Project/Area Number	15K00166
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	High performance computing
Research Institution	The University of Tokyo
Principal Investigator	Hanawa Toshihiro 東京大学, 情報基盤センター, 准教授 (30308283)
Project Period (FY)	2015-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000) Fiscal Year 2017: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2016: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000) Fiscal Year 2015: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Keywords	FPGA / 演算加速装置 / PCI Express / OpenCL / OpenACC / 演算と通信の融合 / GPUクラスタ / 高性能インタコネクト / 密結合演算加速アーキテクチャ / 密結合演算加速機構アーキテクチャ
Outline of Final Research Achievements	Tightly Coupled Accelerators (TCA) architecture, which realizes direct communication among accelerators such as GPUs, is effective for improving strong-scaling performance thanks to low-latency of TCA. In the present study, the feasibility study was performed for the purpose of realization of highly efficient computation by fusion of fast communication using TCA and FPGA computation. Several kernels including numerical algorithms were described for accelerator by OpenCL, and higher performance could be achieved by further modification toward highly pipelined manner. Automatic conversion from OpenACC to OpenCL was also investigated. However, since drastic modification is required from traditional description manner, it is considered that automatic optimization is too complicated.
Academic Significance and Societal Importance of the Research Achievements	演算加速器向けのプログラミング言語であるOpenCLを用いたFPGA実装がFeasibleであることを示した。しかしGPUのようにデータ並列の記述では性能が得られず、FPGAのアーキテクチャを考慮し記述する必要がある。OpenCLのカーネルを分割し、パイプライン方式での制御に変更することで行列積については高い性能が得られた。また、通常のソフトウェア最適化技術と逆行する、冗長な記述や、ループ中での分岐などがFPGAで有効である。今後に向けた最新インタフェースとして、CPUとキャッシュ一貫性を持つFPGA接続、3次元積層メモリに関して性能確認を行い、現状の各5倍、30倍程度のバンド幅が期待できる。

Report

(5 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Research-status Report
2016 Research-status Report
2015 Research-status Report

Research Products
(5 results)

All 2018 2017 2016

All Presentation (5 results) (of which Int'l Joint Research: 1 results)

[Presentation] OpenCLを用いたFPGAによる階層型行列計算2018
- Author(s)
  塙敏博，伊田明弘，星野哲也
- Organizer
  情報処理学会ハイパフォーマンス研究会
- Related Report
  2017 Research-status Report
[Presentation] Design Experience on Intel HARP2 Platform using OpenCL2018
- Author(s)
  Toshihiro Hanawa, Taisuke Boku
- Organizer
  FPGAs in High Performance Computing Mini Symposium, SIAM-PP18
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] 階層型行列計算のFPGAへの適用2017
- Author(s)
  塙敏博，伊田明弘，星野哲也
- Organizer
  情報処理学会ハイパフォーマンス研究会
- Related Report
  2017 Research-status Report
[Presentation] FPGAを用いた階層型行列ベクトル積2016
- Author(s)
  塙敏博伊田明弘大島聡史河合直聡
- Organizer
  情報処理学会ハイパフォーマンス研究会
- Place of Presentation
  キッセイ文化ホール(長野県・松本市)
- Year and Date
  2016-08-08
- Related Report
  2016 Research-status Report
[Presentation] FPGAを用いた疎行列数値計算の性能評価2016
- Author(s)
  大島聡史 , 塙敏博 , 片桐孝洋 , 中島研吾
- Organizer
  情報処理学会ハイパフォーマンス研究会
- Place of Presentation
  ホテル椿舘/にぎたつ会館（愛媛県松山市）
- Year and Date
  2016-03-01
- Related Report
  2015 Research-status Report

Research on Integration of Communication and Computation by Tightly Coupled Accelerators

Principal Investigator

Hanawa Toshihiro 東京大学, 情報基盤センター, 准教授 (30308283)

¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)

Report

Research Products

[Presentation] OpenCLを用いたFPGAによる階層型行列計算2018

Author(s)

Organizer

Related Report

[Presentation] Design Experience on Intel HARP2 Platform using OpenCL2018

Author(s)

Organizer

Related Report

[Presentation] 階層型行列計算のFPGAへの適用2017

Author(s)

Organizer

Related Report

[Presentation] FPGAを用いた階層型行列ベクトル積2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] FPGAを用いた疎行列数値計算の性能評価2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report