2001 Fiscal Year Final Research Report Summary

Performance analysis on hybrid parallel programming on SMP cluster system

Research Project

Project/Area Number	12680327
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	計算機科学
Research Institution	University of Tsukuba
Principal Investigator	BOKU Taisuke Institute of Information Sciences and Electronics, Associate Professor, 電子・情報工学系, 助教授 (90209346)
Co-Investigator(Kenkyū-buntansha)	NAKAMURA Hiroshi University of Tokyo, Research Center for Advanced Science and Technology, Associate Professor, 先端科学技術センター, 助教授 (20212102)
Project Period (FY)	2000 – 2001
Keywords	hybrid programming / SMP cluster / MPI / OpenMP / parallelization paradigm
Research Abstract	In this research, we constructed a PC cluster connecting multiple SMP-based PC nodes with various network interfaces. Such an SMP cluster system has both shared memory and distributed memory architectures together, and there are several possibility for programming like message passing, shared memory and mixture of them. Our experimental cluster contains 4-way and 2-way SMP of Pentium-III processors and two types of interconnection network are available, Myrinet800 and Fast Ethernet. We have evaluated and analyzed the performance of hybrid programming to mixture both MPI and OpenMP, and message passing programming with MPI only targeting NAS Parallel Benchmarks as basic benchmarking and SPAM (Smoothed Particle Applied Mechanics) particle code for actual scientific program. Against to the preliminary estimation, MPI-only programming achieved better performance in most of these programs. To analyze these results, we applied the cache hit ratio measurement for the research on cache behavior according to the programming style. As a result, we have cleared the hybrid program with MPI for inter-node communication and OpenMP for intra-node multithreading often breaks well-tuned cache utilization on MPI-only program. This research concludes that the performance of hybrid program is strongly affected by data access pattern, and it is not always the best scheme to apply OpenMP multithreading after we finish to write the MPI-only program. In some cases, the advantage in communication time of direct access to shared memory on SMP is overcome by the disadvantage of such cache inefficiency. However, when the program naturally has load imbalancing with a certain size of granularity, there is a possibility of the performance of hybrid program overcomes the MPI-only one with dynamic load balancing feature. SPAM particle code is one of such applications.

Research Products

(12 results)

All Other

All Publications (12 results)

[Publications] T.Boku, et al.: "Implementation and performance evaluation of SPAM particle code with Open MP-MPI hybrid programming"Proceeding of European Workshop on Open MP 2001. (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] T.Boku, et al.: "PIO: ParallelI/O System for Massively Parallel Processors"Proceedings of European High Performance Computing and Network Conference 2001. LNCS-2110. 383-392 (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 吉川茂洋, 他: "SPAM粒子シュミレーションのハイブリッド並列化"情報処理学会研究報告. 2001-HPC-87-8. 43-48 (2001)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 吉川茂洋, 他: "SMP-PCクラスタにおけるSPAM粒子シュミレーションのハイブリッド並列化"並列処理シンポジウムJSPP2002. (採録決定). (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 中村宏, 他: "ハイパフォーマンスコンピューティング向けアーキテクチャSCIMA"情報処理学会論文誌ハイパフォーマンスコンピューティングシステム. Vol-41,No.SIG5. 15-27 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] M.Kondo, et al.: "SCIMA: A Novel Processor Architecture for High Performance Computing"Proceedings of HPC Asia 2000. 355-360 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] T. Boku, et. al.: "Implementation and performance evaluation of SPAM particle code with OpenMP-MPI hybrid programming"Proc. of European Workshop on OpenMP 2001. (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T. Boku, et. al.: "PIO : Parallel I/O System for Massively Parallel Processors"Proc. of European High Performance Computing and Network Conference 2001, LNCS-2110. 383-392 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] S. Yoshikawa, et. al.: "Hybrid parallelization of SPAM particle code (in Japanese)"Information Processing Society Japan, SIGHPC Report, 2001-HPC-87-8. 43-48 (2001)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] S. Yoshikawa, et. al.: "Hybrid parallelization of SPAM particle code on SMP-PC cluster (in Japanese)"Proc. of Parallel Processing Symposium JSPP2002. (to be published). (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] H. Nakamura, et. al.: "High Performance Computing Architecture SCIMA (in Japanese)"Transaction of High Performance Computing System, Information Processing Society Japan. Vol. 41, No. SIG5 (HPS1). 15-27 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] M. Kondo, et. al.: "SCIMA : A Novel Processor Architecture for High Performance Computing"Proc. of HPC Asia 2000. 355-360 (2000)
- Description
  「研究成果報告書概要(欧文)」より

2001 Fiscal Year Final Research Report Summary

Performance analysis on hybrid parallel programming on SMP cluster system

Principal Investigator

BOKU Taisuke Institute of Information Sciences and Electronics, Associate Professor, 電子・情報工学系, 助教授 (90209346)

Research Products

[Publications] T.Boku, et al.: "Implementation and performance evaluation of SPAM particle code with Open MP-MPI hybrid programming"Proceeding of European Workshop on Open MP 2001. (2001)

Description

[Publications] T.Boku, et al.: "PIO: ParallelI/O System for Massively Parallel Processors"Proceedings of European High Performance Computing and Network Conference 2001. LNCS-2110. 383-392 (2001)

Description

[Publications] 吉川 茂洋, 他: "SPAM粒子シュミレーションのハイブリッド並列化"情報処理学会研究報告. 2001-HPC-87-8. 43-48 (2001)

Description

[Publications] 吉川 茂洋, 他: "SMP-PCクラスタにおけるSPAM粒子シュミレーションのハイブリッド並列化"並列処理シンポジウムJSPP2002. (採録決定). (2002)

Description

[Publications] 中村 宏, 他: "ハイパフォーマンスコンピューティング向けアーキテクチャSCIMA"情報処理学会論文誌ハイパフォーマンスコンピューティングシステム. Vol-41,No.SIG5. 15-27 (2000)

Description

[Publications] M.Kondo, et al.: "SCIMA: A Novel Processor Architecture for High Performance Computing"Proceedings of HPC Asia 2000. 355-360 (2000)

Description

[Publications] T. Boku, et. al.: "Implementation and performance evaluation of SPAM particle code with OpenMP-MPI hybrid programming"Proc. of European Workshop on OpenMP 2001. (2001)

Description

[Publications] T. Boku, et. al.: "PIO : Parallel I/O System for Massively Parallel Processors"Proc. of European High Performance Computing and Network Conference 2001, LNCS-2110. 383-392 (2001)

Description

[Publications] S. Yoshikawa, et. al.: "Hybrid parallelization of SPAM particle code (in Japanese)"Information Processing Society Japan, SIGHPC Report, 2001-HPC-87-8. 43-48 (2001)

Description

[Publications] S. Yoshikawa, et. al.: "Hybrid parallelization of SPAM particle code on SMP-PC cluster (in Japanese)"Proc. of Parallel Processing Symposium JSPP2002. (to be published). (2002)

Description

[Publications] H. Nakamura, et. al.: "High Performance Computing Architecture SCIMA (in Japanese)"Transaction of High Performance Computing System, Information Processing Society Japan. Vol. 41, No. SIG5 (HPS1). 15-27 (2000)

Description

[Publications] M. Kondo, et. al.: "SCIMA : A Novel Processor Architecture for High Performance Computing"Proc. of HPC Asia 2000. 355-360 (2000)

Description

[Publications] 吉川茂洋, 他: "SPAM粒子シュミレーションのハイブリッド並列化"情報処理学会研究報告. 2001-HPC-87-8. 43-48 (2001)

[Publications] 吉川茂洋, 他: "SMP-PCクラスタにおけるSPAM粒子シュミレーションのハイブリッド並列化"並列処理シンポジウムJSPP2002. (採録決定). (2002)

[Publications] 中村宏, 他: "ハイパフォーマンスコンピューティング向けアーキテクチャSCIMA"情報処理学会論文誌ハイパフォーマンスコンピューティングシステム. Vol-41,No.SIG5. 15-27 (2000)