2008 Fiscal Year Annual Research Report

GPGPUの並列プログラミングモデルの研究

Research Project

Project/Area Number	20240002
Research Institution	Osaka University
Principal Investigator	萩原兼一 Osaka University, 大学院・情報科学研究科, 教授 (00133140)
Co-Investigator(Kenkyū-buntansha)	伊野文彦大阪大学, 大学院・情報科学研究科, 准教授 (90346172)
Keywords	GPGPU / CUDA / コーンビームCT再構成 / 塩基配列探索 / 全点対最短経路長問題 / 相互透過係数畳み込み処理 / 共有メモリ / coalesced参照
Research Abstract	GPU(Graphics Processing Unit)は128〜240個のプロセッサをもつ並列計算機として注目されている.そのプログラミング方法として当初はグラフィックス言語OpenGL/Cgを用いていたが,2006年に異なるプログラミング方法CUDAが発表された.本研究では,どのような工夫をすれば効率のよいCUDAプログラムを設計できるかを解明し,Open/CgプログラムとCUDAプログラムのどちらが効率のよいGPUプログラムとなるかを比較した.応用問題は,a)画像処理分野のコーンビームCT再構成,b)バイオ情報分野の塩基配列探索,c)グラフ理論の全点対最短経路長探索,d)シミュレーションで多用される相互透過係数畳み込み処理とした. 1.a)に関しては,既に市販装置に採用されている研究代表者らのグループが開発したOpen/Cgプログラムと比較した結果,部分的にはOpen/Cgプログラムが性能のよいところもあるが,全体としてCUDAプログラムの方が効率がよいことがわかった.b)に関しても同様である. 2.b)〜,d)は,入力データの一部のみ異なる同一処理を多数実行する必要があり,GPUでそのような処理を同時に実行することにより全体のスループットを向上させる方法に関する知見を得た 3.a)〜d)に関して,小容量の共有メモリの有効利用の方法,それと関連したプロセス数の選択方法,プロセスの粒度などに関して有用な知見を得た. 4.SIMDとして動作しているプロセッサが,GPUのメモリを整然と連続領域をアクセスするcoalesced参照は,データアクセスの性能を上げることが知られているが,d)ではそのために入力データをシャフルする方法の効果を確認した.

Research Products
(13 results)

All 2009 2008 Other

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (9 results) Remarks (1 results)

[Journal Article] Optimization Techniques for Parallel Biophysical Simulations Generated by insilicoIDE2009
- Author(s)
  Eric Heien
- Journal Title
  
  情報処理学会論文誌 : コンピューティングシステム Vol. 2, No. 2(掲載予定)
- Peer Reviewed
[Journal Article] GPUによる高速なコーンビーム再構成 : 円軌道装置のためのRGBAデータへの詰め込み2008
- Author(s)
  吉田征司
- Journal Title
  
  情報処理学会論文誌 : コンピューティングシステム Vol. 1, No. 3
  
  Pages: 41-53
- Peer Reviewed
[Journal Article] A Resource Selection System for Cycle Stealing in GPU Grids2008
- Author(s)
  Yuki Kotani
- Journal Title
  
  Journal of Grid Computing Vol. 6, No. 4
  
  Pages: 399--416
- Peer Reviewed
[Presentation] Real-Time Total Variation Minimization Using the GPU2009
- Author(s)
  Toshihiro Matsuda
- Organizer
  Proceedings of Computer Assisted Radiology and Surgery : 23rd International Congress and Exhibition(CARS 2009)
- Place of Presentation
  Berlin
- Year and Date
  2009-06-23
[Presentation] PyMW - a Python Module for Desktop Grid and Volunteer Computing2009
- Author(s)
  Eric M. Heien
- Organizer
  Proceedings of the 3rd Workshop on Desktop Grids and Volunteer Computing Systems (PCGrid 2009)
- Place of Presentation
  Rome
- Year and Date
  2009-05-29
[Presentation] Harnessing the Power of Idle CPUs for Acceleration of Biological Sequence Alignment2009
- Author(s)
  Fumihiko Ino
- Organizer
  Proceedings of the 2nd Workshop on Large-Scale Parallel Processing (LSPP 2009)
- Place of Presentation
  Rome
- Year and Date
  2009-05-29
[Presentation] PyMW - a Python Module for Parallel Master Worker Computing2009
- Author(s)
  Eric M. Heien
- Organizer
  Proceedings of the 1st International Conference on Parallel. Distributed and Grid Computing for Engineering (PARENG 2009)
- Place of Presentation
  Pecs
- Year and Date
  2009-04-15
[Presentation] RGBA Packing for Fast Cone Beam Reconstruction on the GPU2009
- Author(s)
  Fumihiko Ino
- Organizer
  Proceedings of the SPIE Medical Imaging (MI 2009)
- Place of Presentation
  Orland
- Year and Date
  2009-02-12
[Presentation] Fast Cone Beam Reconstruction Using the CUDA-enabled GPU2008
- Author(s)
  Yusuke Okitsu
- Organizer
  Proceedings of 15th International Conference on High Performance Computing (HiPC 2008)
- Place of Presentation
  Bangalore
- Year and Date
  20081216-20081219
[Presentation] A Task Parallel Algorithm for Computing the Costs of All-Pairs Shortest Paths on the CUDA-compatible GPU2008
- Author(s)
  Tomohiro Okuyama
- Organizer
  Proceedings of the 6th International Symposium on Parallel and Distributed Processing and Applications (ISPA 2008)
- Place of Presentation
  Sydney
- Year and Date
  20081208-20081213
[Presentation] Design and Implementation of the Smith-Waterman Algorithm on the CUDA-Compatible GPU2008
- Author(s)
  Yuma Munekawa
- Organizer
  Proceedings of the 8th IEEE International Conference on Bioinformatics and Bioengineering (BIBE 2008)
- Place of Presentation
  Athens
- Year and Date
  20081006-20081012
[Presentation] Computing Low Latency Batches with Unreliable Workers in Volunteer Computing Environments2008
- Author(s)
  Eric Martin Heien
- Organizer
  Proceedings of the 2nd Workshop on Desktop Grids and Volunteer Computing Systems (PCGrid 2008)
- Place of Presentation
  Miami
- Year and Date
  2008-04-15
[Remarks]
- URL
  http://www-hagi.ist.osaka-u.ac.jp/

2008 Fiscal Year Annual Research Report

GPGPUの並列プログラミングモデルの研究

Principal Investigator

萩原 兼一 Osaka University, 大学院・情報科学研究科, 教授 (00133140)

Research Products

[Journal Article] Optimization Techniques for Parallel Biophysical Simulations Generated by insilicoIDE2009

Author(s)

Journal Title

[Journal Article] GPUによる高速なコーンビーム再構成 : 円軌道装置のためのRGBAデータへの詰め込み2008

Author(s)

Journal Title

[Journal Article] A Resource Selection System for Cycle Stealing in GPU Grids2008

Author(s)

Journal Title

[Presentation] Real-Time Total Variation Minimization Using the GPU2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] PyMW - a Python Module for Desktop Grid and Volunteer Computing2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Harnessing the Power of Idle CPUs for Acceleration of Biological Sequence Alignment2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] PyMW - a Python Module for Parallel Master Worker Computing2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] RGBA Packing for Fast Cone Beam Reconstruction on the GPU2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Fast Cone Beam Reconstruction Using the CUDA-enabled GPU2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A Task Parallel Algorithm for Computing the Costs of All-Pairs Shortest Paths on the CUDA-compatible GPU2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Design and Implementation of the Smith-Waterman Algorithm on the CUDA-Compatible GPU2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Computing Low Latency Batches with Unreliable Workers in Volunteer Computing Environments2008

Author(s)

Organizer

Place of Presentation

Year and Date

[Remarks]

URL

萩原兼一 Osaka University, 大学院・情報科学研究科, 教授 (00133140)