2005 Fiscal Year Final Research Report Summary

Thread Partitioning and Speculative Execution for On-Chip Multiprocessor

Research Project

Project/Area Number	15500036
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Computer system/Network
Research Institution	Nagoya University
Principal Investigator	SHIMADA Toshio Nagoya University, Graduate School of Engineering, Professor, 大学院・工学研究科, 教授 (60252251)
Project Period (FY)	2003 – 2005
Keywords	Chip Multiprocessor / Thread Partitioning / Value Prediction / Compiler / Multi-Threading / Speculative Execution
Research Abstract	On-chip multiprocessors can reduce the overhead related to inter-thread communication, and exploit thread-level parallelism in addition to instruction-level parallelism. This study focuses on these advantages to explore techniques that improve performance in non-numerical programs where only fine-grain threads are available. In this study, we propose thread partitioning techniques and speculative execution techniques for on-chip multiprocessors. Our contributions are as follows. 1. We introduce value prediction to mitigate the constraint of control and-data dependences between threads and evaluate it. Our evaluation results show that the speculative thread execution achieves performance improvements by 12.7% in SPECint2000 over conventional non-speculative thread execution. 2. We propose a scheme that reduces the required number of physical registers by sharing physical registers among threads and enables non-blocking communication on registers. Our evaluation results show that multithreaded execution with our scheme achieves higher performance than the single-threaded execution with 130 physical registers, and reduces the number of physical registers by 50%. 3. We propose a two-step physical register deallocation scheme that exploits potential ILP within a single-thread by suppressing the occurrence of stalls caused by physical register shortage. Our evaluation results show that this scheme achieves significant speedups of 32% on average in the typical case of 64 physical registers. 4. We explore what techniques are required to extract large amounts of TLP in programs. Our evaluation results show that the combination of speculative thread execution, speculative register communication, and basic block-level partitioning achieves a ten-times speedup than a single-thread.

Research Products
(18 results)

All 2005 2004 Other

All Journal Article (18 results)

[Journal Article] 単一チップ・マルチプロセッサSKYにおける投機的実行の性能評価2005
- Author(s)
  上村井明夫, 他3名
- Journal Title
  
  電子情報通信学会信学技法 CPSY2004 Vol. 104, No. 592
  
  Pages: 43-48
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] VT-CMOSキャッシュの性能低下をアドレス予測を用いて低減する先行起動機構2005
- Author(s)
  小林良太郎, 他3名
- Journal Title
  
  情報処理学会論文誌、コンピューティングシステム Vol. 46, No. SIG 3(ACS 8)
  
  Pages: 92-106
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Limit of Thread-Level Parallelism on Partitioning Levels and Speculations in Non-Numerical Programs2005
- Author(s)
  Akio Nakajima, 他3名
- Journal Title
  
  The 8th International Symposium on Low-Power and High-Speed Chips COOL Chips VIII
  
  Pages: 465-472
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] SMTプロセッサにおける物理レジスタ共有によるスレッド間通信機構2005
- Author(s)
  澁谷真帆, 他3名
- Journal Title
  
  2005年先進的計算基盤システムシンポジウム SACSIS 2005
  
  Pages: 407-414
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] スーパスカラ・プロセッサのための物理レジスタ2段階解放2005
- Author(s)
  山本哲弘, 他2名
- Journal Title
  
  情報処理学会研究報告 2005-ARC-164
  
  Pages: 7-12
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Performance Evaluation of Speculative Thread Execution in the Single-Chip Multiprocessor SKY2005
- Author(s)
  Akio Kamimurai
- Journal Title
  
  IEICE Technical Report CPSY2004 Vol.104, No.592
  
  Pages: 43-48
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] A Preactivating Mechanism for Suppressing the Performance Degradation in a VT-CMOS Cache using Address Prediction2005
- Author(s)
  Ryotaro Kobayashi
- Journal Title
  
  IPSJ Transactions on Advanced Computing Systems Vol.46, No.SIG 3(ACS 8)
  
  Pages: 92-106
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Limit of Thread-Level Parallelism on Partitioning Levels and Speculations in Non-Numerical Programs2005
- Author(s)
  Akio Nakajima
- Journal Title
  
  The 8th International Symposium on Low-Power and High-Speed Chips (COOL Chips VIII)
  
  Pages: 465-472
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] An Inter-thread Communication Mechanism with Physical Register Sharing for SMT Processors2005
- Author(s)
  Maho Shibuya
- Journal Title
  
  Symposium on Advanced Computing Systems and Infrastructures (SACSIS) 2005
  
  Pages: 407-414
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Two-Step Physical Register Deallocation for Superscalar Processors2005
- Author(s)
  Akihiro Yamamoto
- Journal Title
  
  IPSJ SIG Technical Reports 2005-ARC-164
  
  Pages: 7-12
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] 非数値計算プログラムにおけるスレッドレベル並列性の限界 : スレッド間メモリ曖昧性除去技術との関係2004
- Author(s)
  中嶋昭夫, 他3名
- Journal Title
  
  情報処理学会研究報告 2004-ARC-156
  
  Pages: 19-24
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] 頻出値を利用した物理レジスタの共有化手法2004
- Author(s)
  山本哲弘, 他5名
- Journal Title
  
  情報処理学会論文誌、コンピューティングシステム Vol. 45, No. SIG 11(ACS 7)
  
  Pages: 133-143
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] SKYのマルチスレッド・モデルを実現したSMTプロセッサにおける物理レジスタの共有化手法2004
- Author(s)
  澁谷真帆, 他3名
- Journal Title
  
  情報処理学会研究報告 2004-ARC-160
  
  Pages: 41-46
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Limits of Thread-Level Parallelism in Non-Numerical Programs : Relation with Inter-Thread Memory Disambiguation Techniques2004
- Author(s)
  Akio Nakajima
- Journal Title
  
  IPSJ SIG Technical Reports 2004-ARC-156
  
  Pages: 19-24
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] A Sharing Scheme of Physical Registers by Exploiting Frequent Values2004
- Author(s)
  Akihiro Yamamoto
- Journal Title
  
  IPSJ Transactions on Advanced Computing Systems Vol.45, No SIG 11 (ACS 7)
  
  Pages: 133-143
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] A Physical Register Sharing Technique for an SMT Processor Implementing the Multithread Model of the SKY2004
- Author(s)
  Maho Shibuya
- Journal Title
  
  IPSJ SIG Technical Reports 2004-ARC-160
  
  Pages: 41-46
- Description
  「研究成果報告書概要(欧文)」より
[Journal Article] Limits of Thread-Level Parallelism in Non-numerical Programs
- Author(s)
  Akio Nakajima, 他3名
- Journal Title
  
  情報処理学会論文誌、コンピューティングシステム (印刷中)
- Description
  「研究成果報告書概要(和文)」より
[Journal Article] Limits of Thread-Level Parallelism in Non-numerical Programs
- Author(s)
  Akio Nakajima
- Journal Title
  
  IPSJ Transactions on Advanced Computing Systems (Already Accepted)
- Description
  「研究成果報告書概要(欧文)」より

2005 Fiscal Year Final Research Report Summary

Thread Partitioning and Speculative Execution for On-Chip Multiprocessor

Principal Investigator

SHIMADA Toshio Nagoya University, Graduate School of Engineering, Professor, 大学院・工学研究科, 教授 (60252251)

Research Products

[Journal Article] 単一チップ・マルチプロセッサSKYにおける投機的実行の性能評価2005

Author(s)

Journal Title

Description

[Journal Article] VT-CMOSキャッシュの性能低下をアドレス予測を用いて低減する先行起動機構2005

Author(s)

Journal Title

Description

[Journal Article] Limit of Thread-Level Parallelism on Partitioning Levels and Speculations in Non-Numerical Programs2005

Author(s)

Journal Title

Description

[Journal Article] SMTプロセッサにおける物理レジスタ共有によるスレッド間通信機構2005

Author(s)

Journal Title

Description

[Journal Article] スーパスカラ・プロセッサのための物理レジスタ2段階解放2005

Author(s)

Journal Title

Description

[Journal Article] Performance Evaluation of Speculative Thread Execution in the Single-Chip Multiprocessor SKY2005

Author(s)

Journal Title

Description

[Journal Article] A Preactivating Mechanism for Suppressing the Performance Degradation in a VT-CMOS Cache using Address Prediction2005

Author(s)

Journal Title

Description

[Journal Article] Limit of Thread-Level Parallelism on Partitioning Levels and Speculations in Non-Numerical Programs2005

Author(s)

Journal Title

Description

[Journal Article] An Inter-thread Communication Mechanism with Physical Register Sharing for SMT Processors2005

Author(s)

Journal Title

Description

[Journal Article] Two-Step Physical Register Deallocation for Superscalar Processors2005

Author(s)

Journal Title

Description

[Journal Article] 非数値計算プログラムにおけるスレッドレベル並列性の限界 : スレッド間メモリ曖昧性除去技術との関係2004

Author(s)

Journal Title

Description

[Journal Article] 頻出値を利用した物理レジスタの共有化手法2004

Author(s)

Journal Title

Description

[Journal Article] SKYのマルチスレッド・モデルを実現したSMTプロセッサにおける物理レジスタの共有化手法2004

Author(s)

Journal Title

Description

[Journal Article] Limits of Thread-Level Parallelism in Non-Numerical Programs : Relation with Inter-Thread Memory Disambiguation Techniques2004

Author(s)

Journal Title

Description

[Journal Article] A Sharing Scheme of Physical Registers by Exploiting Frequent Values2004

Author(s)

Journal Title

Description

[Journal Article] A Physical Register Sharing Technique for an SMT Processor Implementing the Multithread Model of the SKY2004

Author(s)

Journal Title

Description

[Journal Article] Limits of Thread-Level Parallelism in Non-numerical Programs

Author(s)

Journal Title

Description

[Journal Article] Limits of Thread-Level Parallelism in Non-numerical Programs

Author(s)

Journal Title

Description