Research on Software Controlled Integrated Memory Architecture for Large Scientific Computing

Research Project

Project/Area Number	10680335
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	計算機科学
Research Institution	Research on Software Controlled Integrated Memory Architecture for Large Scientific Computing
Principal Investigator	NAKAMURA Hiroshi The University of Tokyo, Research Center for Advanced science and Technology, Associate Professor, 先端科学技術研究センター, 助教授 (20212102)
Co-Investigator(Kenkyū-buntansha)	NANYA Takashi The University of Tokyo, Research Center for Advanced Science and Technology, Professor, 先端科学技術研究センター, 教授 (80143684)
Project Period (FY)	1998 – 1999
Project Status	Completed (Fiscal Year 1999)
Budget Amount *help	¥3,400,000 (Direct Cost: ¥3,400,000) Fiscal Year 1999: ¥1,300,000 (Direct Cost: ¥1,300,000) Fiscal Year 1998: ¥2,100,000 (Direct Cost: ¥2,100,000)
Keywords	Processor Architecture / Memory Hierarchy / Scientific Computing / High Performance Computing / プロセッサアーキテクチュ
Research Abstract	Sufficient memory throughput is indispensable for high performance computing in large scale scientific computing. In this research, we propose a new architecture for providing sufficient memory throughput by integrating software controllable memory into processor chip. Since the integrated On-Chip Memory is explicitly addressed by software, only the required data is pre-transferred into the On-Chip Memory without flushing out other required data caused by unfortunate conflicts which occurs frequently in conventional cache. At this point, On-Chip Memory is better to exploit temporal locality than cache. We developed a clock-level simulator for the proposed architecture and evaluated it by using practical scientific applications. In the simulation, various configurations can be explored including the number of functional units, latency and issue rate for each operation, the structure of data cache and On-Chip Memory, and throughput and latency of off-chip memory. The evaluation results by using QCD(Quantum Chromo Dynamics) computation reveals that the proposed architecture decreases off-chip memory traffic compared with conventional cache-only architecture. When the latency of off-chip memory is 40 CPU cycles, the architecture achieves 2.7 times improvement in performance. The degree of the performance improvement increases for longer off-chip memory latency. Off-Chip Memory latency is expected to increase and Off-Chip Memory bandwidth is expected to decrease. Therefore, the results indicate that the effectiveness of the proposed architecture will continue to grow in the future.

Report

(3 results)

1999 Annual Research Report Final Research Report Summary
1998 Annual Research Report

Research Products
(21 results)

All Other

All Publications (21 results)

[Publications] P. Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance"IEEE Transactions on Computers. Vol.48, No.2. 142-149 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリ階層の検討"情報処理学会研究報告. ARC99-133. 55-60 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリアーキテクチャの予備評価"情報処理学会研究報告. ARC2000-136. 13-18 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. Vol. 25. 1243-1255 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] K. Nakazawa: "CP-PACS: A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. Vol. 25. 1635-1661 (1999)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] M. Kondo: "SCIMA: A Novel Processor Architecture for High Performance Computing"Proc. of High Performance Computing Asia 2000. (印刷中). (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] P. Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance"IEEE Transactions on Computers. Vol. 48 No. 2. 142-149 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] H. Okawara: "Preliminary Evaluation of New Memory Hierarchy for High Performance Computing"Technical Report of IPSJ. ARC99-133. 55-60 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] H. Okawara: "Preliminary Performance Evaluation of New Memory Architecture for High Performance Computing"Technical Report of IPSJ. ARC2000-136. 13-18 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. Vol. 25. 1243-1255 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] K. Nakazawa: "CP-PACS : A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. Vol. 25. 1635-1661 (1999)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] M. Kondo: "SCIMA : A Novel Processor Architecture for High Performance Computing"Proc. of High Performance Computing Asia 2000. (to appear). (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1999 Final Research Report Summary
[Publications] 今井雅: "SDIモデルに基づいた非同期式パイプライン・データパスの論理合成"情報処理学会論文誌. 40・4. 1947-1956 (1999)
- Related Report
  1999 Annual Research Report
[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリ階層の検討"情報処理学会研究報告 ARC. 133. 55-60 (1999)
- Related Report
  1999 Annual Research Report
[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリアーキテクチャの予備評価"情報処理学会研究報告 ARC. 136. 13-18 (2000)
- Related Report
  1999 Annual Research Report
[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. 25. 1243-1255 (1999)
- Related Report
  1999 Annual Research Report
[Publications] K. Nakazawa: "CP-PACS : A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. 25. 1635-1661 (1999)
- Related Report
  1999 Annual Research Report
[Publications] M. Kondo: "SCIMA : A Novel Processor Architecture for High Performance Computing"High Performance computing Asia 2000. (採録決定). (2000)
- Related Report
  1999 Annual Research Report
[Publications] 板倉憲一: "超並列計算機CP-PACSにおけるNPB Kernel CGの評価" 情報処理学会論文誌. 39・6. 1757-1765 (1998)
- Related Report
  1998 Annual Research Report
[Publications] P.R.Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance" IEEE Transactions on Computers. 48・2. (1999)
- Related Report
  1998 Annual Research Report
[Publications] 近藤正章: "オンチップメモリを用いたHPCプロセッサの検討" 情報処理学会研究報告. HPC-75. 85-90 (1999)
- Related Report
  1998 Annual Research Report

Research on Software Controlled Integrated Memory Architecture for Large Scientific Computing

Principal Investigator

NAKAMURA Hiroshi The University of Tokyo, Research Center for Advanced science and Technology, Associate Professor, 先端科学技術研究センター, 助教授 (20212102)

¥3,400,000 (Direct Cost: ¥3,400,000)

Report

Research Products

[Publications] P. Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance"IEEE Transactions on Computers. Vol.48, No.2. 142-149 (1999)

Description

Related Report

[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリ階層の検討"情報処理学会研究報告. ARC99-133. 55-60 (1999)

Description

Related Report

[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリアーキテクチャの予備評価"情報処理学会研究報告. ARC2000-136. 13-18 (2000)

Description

Related Report

[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. Vol. 25. 1243-1255 (1999)

Description

Related Report

[Publications] K. Nakazawa: "CP-PACS: A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. Vol. 25. 1635-1661 (1999)

Description

Related Report

[Publications] M. Kondo: "SCIMA: A Novel Processor Architecture for High Performance Computing"Proc. of High Performance Computing Asia 2000. (印刷中). (2000)

Description

Related Report

[Publications] P. Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance"IEEE Transactions on Computers. Vol. 48 No. 2. 142-149 (1999)

Description

Related Report

[Publications] H. Okawara: "Preliminary Evaluation of New Memory Hierarchy for High Performance Computing"Technical Report of IPSJ. ARC99-133. 55-60 (1999)

Description

Related Report

[Publications] H. Okawara: "Preliminary Performance Evaluation of New Memory Architecture for High Performance Computing"Technical Report of IPSJ. ARC2000-136. 13-18 (2000)

Description

Related Report

[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. Vol. 25. 1243-1255 (1999)

Description

Related Report

[Publications] K. Nakazawa: "CP-PACS : A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. Vol. 25. 1635-1661 (1999)

Description

Related Report

[Publications] M. Kondo: "SCIMA : A Novel Processor Architecture for High Performance Computing"Proc. of High Performance Computing Asia 2000. (to appear). (2000)

Description

Related Report

[Publications] 今井雅: "SDIモデルに基づいた非同期式パイプライン・データパスの論理合成"情報処理学会論文誌. 40・4. 1947-1956 (1999)

Related Report

[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリ階層の検討"情報処理学会研究報告 ARC. 133. 55-60 (1999)

Related Report

[Publications] 大河原英喜: "ハイパフォーマンスコンピューティングに適したメモリアーキテクチャの予備評価"情報処理学会研究報告 ARC. 136. 13-18 (2000)

Related Report

[Publications] S. Aoki: "Performance of lattice QCD programs on CP-PACS"Journal of Parallel Computing. 25. 1243-1255 (1999)

Related Report

[Publications] K. Nakazawa: "CP-PACS : A Massively Parallel Processor at University of Tsukuba"Journal of Parallel Computing. 25. 1635-1661 (1999)

Related Report

[Publications] M. Kondo: "SCIMA : A Novel Processor Architecture for High Performance Computing"High Performance computing Asia 2000. (採録決定). (2000)

Related Report

[Publications] 板倉憲一: "超並列計算機CP-PACSにおけるNPB Kernel CGの評価" 情報処理学会論文誌. 39・6. 1757-1765 (1998)

Related Report

[Publications] P.R.Panda: "Augmenting Loop Tiling with Data Alignment for Improved Cache Performance" IEEE Transactions on Computers. 48・2. (1999)

Related Report

[Publications] 近藤正章: "オンチップメモリを用いたHPCプロセッサの検討" 情報処理学会研究報告. HPC-75. 85-90 (1999)

Related Report