2004 Fiscal Year Annual Research Report

分断共有メモリを用いた疎行列線形代数演算ライブラリの効率的な実装技術

Research Project

Project/Area Number	16016225
Research Institution	The University of Tokyo
Principal Investigator	西田晃東京大学, 大学院・情報理工学系研究科, 助手 (60302808)
Keywords	疎行列線形代数演算ライブラリ / 並列処理 / ソフトウェア分散共有メモリ / PCクラスタ / 高性能ネットワーク / コモディティ技術 / InfiniBand / PCI Express
Research Abstract	今年度は,分散共有メモリを用いた疎行列線形代数演算の実装技術,及びPCI Express, InfiniBandを利用したソフトウェア分散共有メモリの実現手法に関して,以下の成果を得た. 分散共有メモリを用いた疎行列数値線形代数演算の実装技術大規模疎行列を扱う反復解法において,ベクトル間演算は計算量の大部分を占める重要な処理である.しかしながら,メモリ参照の多いベクトル間演算においてはキャッシュメモリの活用が難しく,スカラ型アーキテクチャ上での並列処理では性能が得にくいことが知られている.そこで,既存の分散共有メモリシステムについて評価を行った結果,並列度を変えてベクトル演算ベンチマークを実行することにより,比較的顕著に計算機の特性を評価できることを明らかにした.またこの結果から,ベクトル演算の効率的な実装を実現するために,通常のアプリケーションとは異なった観点からメモリ帯域幅とネットワーク性能に留意したアーキテクチャを構築する必要があることを示した. PCI Express, InfiniBandを利用したソフトウェア分散共有メモリの実現 PCI Expressは従来の入出力規格であるPCIバスと互換性を持つ次世代シリアルインタフェース規格であり,平成16年に実用化が開始された.一方向2.5Gb/sの帯域幅を持つレーンを32本まで組み合わせて利用することができ,最大で16GB/sの実効帯域幅を実現する.PCI Expressへの対応が予定されている高速ネットワーク技術としては,最大6.4GB/sの帯域幅を持つInfiniBandを挙げることができ,これらを適切な計算ノードと組み合わせることにより,高性能なソフトウェア分散共有メモリを構築できると考えられる.メモリ性能を評価した結果,計算ノードとしてAMD社製Opteronプロセッサを採用することとし,3月までに実験に必要な計算環境を構築した.またソフトウェアに関してはOmni SCASHの移植を予定しており,これと並行して上記のネットワーク技術およびCPUアーキテクチャへの対応について,InfiniBandベンダと連携して技術的な問題点を検討した.

Research Products
(6 results)

All 2004 2003

All Journal Article (6 results)

[Journal Article] Performance Evaluation of Parallel AMG Preconditioned Conjugate Gradient Methods for Large Scale Eigenproblems2004
- Author(s)
  A.Nishida
- Journal Title
  
  IPSJ SIC Notes 2004(20)
  
  Pages: 205-210
[Journal Article] Parallel AMG Preconditioned Conjugate Residual Methods for Nonsymmetric Eigenproblems and its Evalutation2004
- Author(s)
  A.Nishida
- Journal Title
  
  IPSJ SIC Notes 2004(81)
  
  Pages: 85-90
[Journal Article] The Evaluation of The Aggregate Creation Orders : Smoothed Aggregation Algebraic MultiGrid Method2004
- Author(s)
  A.Fujii, A.Nishida, Y.Oyanagi
- Journal Title
  
  Proceedings of International Symposium on High Performance Computational Science and Engineering CDROM
[Journal Article] Performance Evaluation of Low Level Multithreaded BLAS Kernels on Intel Processor based cc-NUMA Systems2003
- Author(s)
  A.Nishida, Y.Oyanagi
- Journal Title
  
  Lecture Notes in Computer Science 2858
  
  Pages: 500-510
[Journal Article] Parallel AMG Algorithm by Domain Decomposition2003
- Author(s)
  A.Fujii, A.Nishida, Y.Oyanagi
- Journal Title
  
  IPSJ Transactions on Advanced Computing Systems Vol.44,No.SIG 6
  
  Pages: 9-17
[Journal Article] Improvement and evaluation of Smoothed Aggregation MG for anisotropic problems2003
- Author(s)
  A.Fujii, A.Nishida, Y.Oyanagi
- Journal Title
  
  Proceedings of Symposium on Advanced Computing Systems and Infrastructures
  
  Pages: 137-144

2004 Fiscal Year Annual Research Report

分断共有メモリを用いた疎行列線形代数演算ライブラリの効率的な実装技術

Principal Investigator

西田 晃 東京大学, 大学院・情報理工学系研究科, 助手 (60302808)

Research Products

[Journal Article] Performance Evaluation of Parallel AMG Preconditioned Conjugate Gradient Methods for Large Scale Eigenproblems2004

Author(s)

Journal Title

[Journal Article] Parallel AMG Preconditioned Conjugate Residual Methods for Nonsymmetric Eigenproblems and its Evalutation2004

Author(s)

Journal Title

[Journal Article] The Evaluation of The Aggregate Creation Orders : Smoothed Aggregation Algebraic MultiGrid Method2004

Author(s)

Journal Title

[Journal Article] Performance Evaluation of Low Level Multithreaded BLAS Kernels on Intel Processor based cc-NUMA Systems2003

Author(s)

Journal Title

[Journal Article] Parallel AMG Algorithm by Domain Decomposition2003

Author(s)

Journal Title

[Journal Article] Improvement and evaluation of Smoothed Aggregation MG for anisotropic problems2003

Author(s)

Journal Title

西田晃東京大学, 大学院・情報理工学系研究科, 助手 (60302808)