2012 Fiscal Year Research-status Report

進化計算の高速化を実現するＧＰＧＰＵ基盤ソフトウェアの開発

Research Project

Project/Area Number	23500285
Research Institution	Osaka Prefecture University
Principal Investigator	藤本典幸大阪府立大学, 理学(系)研究科(研究院), 教授 (90294165)
Keywords	進化計算 / 並列処理 / GPGPU
Research Abstract	本研究の目的は，現在注目されており将来も有望な並列計算プラットフォームであるGPUを用いて，様々な問題に応用できる進化計算を高速に実行するための一般的手法・方法論を世界に先駆けて開発することである．進化計算は，個体の集団に対して操作を繰り返して解を探索するアルゴリズムの総称であり，遺伝アルゴリズムなど，いくつかの手法がある．ところで遺伝アルゴリズムは，解候補を表現する複数の個体に遺伝的操作を施すことを繰り返すアルゴリズムであり，各個体に対する処理は独立であるので，個体処理間に自明な並列性がある．しかし，２次割当て問題や巡回セールスマン問題などの，多くの問題で用いられる個体数は，局所探索を併用する性能のよい遺伝アルゴリズムの場合は，数十から数百程度である．このため1個体の処理を1スレッドで実行する単純な並列化を行うと，遺伝アルゴリズム全体で用いるスレッド数が高々数十から数百程度で抑えられてしまうが，これではCUDA対応GPUが必要とする数万スレッドに満たない．このため，GPUは価格性能比が非常によい魅力的な計算プラットフォームとして現在注目されているが，進化計算についてはGPUを単純に適用することが難しい．平成24年度は，昨年度に引き続き，個体処理間の並列性の利用に加えて，1個体の処理内の並列性も引き出し，遺伝アルゴリズム全体を高い並列度で並列処理するための手法の研究を行った．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 最適割当て問題の中で最も困難な問題の一つである2次割当て問題（Quadratic Assignment Problem, QAP）の高速解法にGPUを適用する方法の研究を進めている。2次割当て問題は、TSPと同様、組合せ最適化問題の解法におけるベンチマーク問題として用いられるとともに、病院の部署の最適配置問題や工場の最適立地計画問題など実問題への多くの応用があり、その高速解法は有用な意味を持つ。この解法においては，進化計算とローカルサーチとを組合せることが有効であることが知られている。今年度はACO（Ant Colony Optimization）を複数のGPUを用いて並列高速化する方法を研究した。また、ヒストグラムを用いたEDA（The estimation of distribution algorithms）をGPUに実装する方法を研究した。成果に関しては国際会議 CEC 2012 [1]で報告した。 [1] Shigeyoshi Tsutsui and Noriyuki Fujimoto. Implementation of histogram based sampling algorithm within an EDA scheme with CUDA, Proceedings of the IEEE Congress on Evolutionary Computation (CEC 2012), pp. 1-8, IEEE, 2012.
Strategy for Future Research Activity	GPU利用の成功事例が多数報告されている数値計算などのアルゴリズムでは大規模な配列データへのアクセスパターンは静的に決まっているか，動的に決まるとしてもランダム性はない．このため共有メモリやキャッシュメモリの利用が比較的容易である．これに対して遺伝アルゴリズムでは，多数の個体の遺伝情報を記憶した大規模な配列に対するアクセスパターンが動的かつランダムに決まる．このため遺伝アルゴリズムではGPUのキャッシュメモリや共有メモリの有効利用はチャレンジングな課題となる．そこで本研究では，大規模な配列データへのランダムアクセスを高速化するソフトウェアキャッシュ手法の開発を行う．大規模配列データはVRAM上に配置せざるを得ないが，参照の局所性の高い部分を各マルチプロセッサの共有メモリに動的にロードする手法（ソフトウェアキャッシュ）を用意できれば，遺伝アルゴリズム全体の実行を大幅に高速化できると考えられる．GPUのVRAMと共有メモリにはコアレスアクセスやバンク衝突と呼ばれる性能特性があるため，このソフトウェアキャッシュ手法の開発はチャレンジングな課題となる．
Expenditure Plans for the Next FY Research Funding	次年度使用額が22万円強残った理由は，当該年度に発売されるはずであった新アーキテクチャのGPUが，半導体製造技術のプロセスルール微細化のスケジュールの遅れにより，年度末に持ち越されたためである． GPUの性能向上は著しい（１年～１年半に１回程度，性能が大きく向上した新しいGPUが製品化される）ため，研究を効果的に進めるためには，最新のGPUを備えた計算機が必要である．特に今年度はGPUのアーキテクチャが刷新される都市であり，3月に新アーキテクチャに基づくGPUのハイエンド製品が発売済されたが年度末時点では入手困難で，6月以降には入手しやすくなると予想される．このためGPU計算サーバ１台を購入予定である． GPUを用いた汎用計算は，ここ数年の間に立ち上がった新しいテーマであるので，研究を効果的に進めるためには学会に参加しての他の研究者と直接交流による情報収集が必須である．このため研究成果発表に加えて情報収集のための国内および海外出張をそれぞれ３回および２回程度行う予定である．開発したGPUプログラムの性能評価を行う際には，他のGPUプログラムとの比較はもちろんのことであるが，それに加えて，できるだけ性能のよいCPUプログラムとの比較も行うべきであるため，CPU用の高性能な目的プログラムを生成するIntelコンパイラの年間利用ライセンスを購入予定である．

Research Products
(10 results)

All 2013 2012

All Journal Article (5 results) (of which Peer Reviewed: 4 results) Presentation (5 results)

[Journal Article] Highly Simplified MAC法のGPUを用いた高速化2013
- Author(s)
  河南克也,藤本典幸
- Journal Title
  
  電子情報通信学会2013年総合大会講演論文集
  
  Volume: なし Pages: 11-11
[Journal Article] GPU による多倍長整数乗算の高速化手法の提案とその評価2012
- Author(s)
  北野晃司,藤本典幸
- Journal Title
  
  第10回先進的計算基盤システムシンポジウム(SACSIS)論文集
  
  Volume: なし Pages: 185-192
- Peer Reviewed
[Journal Article] Implementation of Histogram Based Sampling Algorithm within an EDA Scheme with CUDA2012
- Author(s)
  Shigeyoshi Tsutsui, Noriyuki Fujimoto
- Journal Title
  
  Proceedings of IEEE World Congress on Computational Intelligence
  
  Volume: なし Pages: 1-8
- DOI
  10.1109/CEC.2012.6256444
- Peer Reviewed
[Journal Article] On the effect of using multiple GPUs in solving QAPs with CUDA2012
- Author(s)
  Shigeyoshi Tsutsui, Noriyuki Fujimoto
- Journal Title
  
  the Genetic and Evolutionary Computation Conference (GECCO)
  
  Volume: なし Pages: 629-630
- DOI
  10.1145/2330784.2330893
- Peer Reviewed
[Journal Article] GPU Acceleration of BCP Procedure for SAT Algorithms2012
- Author(s)
  Hironori Fujii,Noriyuki Fujimoto
- Journal Title
  
  Proceedings of International Conference on Parallel and Distributed Processing Techniques and Applications
  
  Volume: II Pages: 764-770
- Peer Reviewed
[Presentation] Highly Simplified MAC法のGPUを用いた高速化2013
- Author(s)
  河南克也,藤本典幸
- Organizer
  電子情報通信学会総合大会
- Place of Presentation
  岐阜
- Year and Date
  20130319-20130322
[Presentation] GPU Acceleration of BCP Procedure for SAT Algorithms2012
- Author(s)
  Hironori Fujii,Noriyuki Fujimoto
- Organizer
  Proceedings of International Conference on Parallel and Distributed Processing Techniques and Applications
- Place of Presentation
  Las Vegas, USA
- Year and Date
  20120716-20120719
[Presentation] On the effect of using multiple GPUs in solving QAPs with CUDA2012
- Author(s)
  Shigeyoshi Tsutsui, Noriyuki Fujimoto
- Organizer
  the Genetic and Evolutionary Computation Conference (GECCO)
- Place of Presentation
  Philadelphia, USA
- Year and Date
  20120707-20120711
[Presentation] Implementation of Histogram Based Sampling Algorithm within an EDA Scheme with CUDA2012
- Author(s)
  Shigeyoshi Tsutsui, Noriyuki Fujimoto
- Organizer
  Proceedings of IEEE World Congress on Computational Intelligence
- Place of Presentation
  Brisbane, Australia
- Year and Date
  20120610-20120615
[Presentation] GPU による多倍長整数乗算の高速化手法の提案とその評価2012
- Author(s)
  北野晃司,藤本典幸
- Organizer
  第10回先進的計算基盤システムシンポジウム(SACSIS)
- Place of Presentation
  神戸
- Year and Date
  20120516-20120518

2012 Fiscal Year Research-status Report

進化計算の高速化を実現するＧＰＧＰＵ基盤ソフトウェアの開発

Principal Investigator

藤本 典幸 大阪府立大学, 理学(系)研究科(研究院), 教授 (90294165)

Current Status of Research Progress

Reason

Research Products

[Journal Article] Highly Simplified MAC法のGPUを用いた高速化2013

Author(s)

Journal Title

[Journal Article] GPU による多倍長整数乗算の高速化手法の提案とその評価2012

Author(s)

Journal Title

[Journal Article] Implementation of Histogram Based Sampling Algorithm within an EDA Scheme with CUDA2012

Author(s)

Journal Title

DOI

[Journal Article] On the effect of using multiple GPUs in solving QAPs with CUDA2012

Author(s)

Journal Title

DOI

[Journal Article] GPU Acceleration of BCP Procedure for SAT Algorithms2012

Author(s)

Journal Title

[Presentation] Highly Simplified MAC法のGPUを用いた高速化2013

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] GPU Acceleration of BCP Procedure for SAT Algorithms2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] On the effect of using multiple GPUs in solving QAPs with CUDA2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Implementation of Histogram Based Sampling Algorithm within an EDA Scheme with CUDA2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] GPU による多倍長整数乗算の高速化手法の提案とその評価2012

Author(s)

Organizer

Place of Presentation

Year and Date

藤本典幸大阪府立大学, 理学(系)研究科(研究院), 教授 (90294165)