Research on a Very Large Data Path processor

Research Project

Project/Area Number	07458052
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	計算機科学
Research Institution	University of Tokyo
Principal Investigator	TANAKA Hidehiko The University of Tokyo, Graduate School of Engineering, 大学院・工学系研究科, 教授 (60011102)
Project Period (FY)	1995 – 1997
Project Status	Completed (Fiscal Year 1997)
Budget Amount *help	¥6,800,000 (Direct Cost: ¥6,800,000) Fiscal Year 1997: ¥1,000,000 (Direct Cost: ¥1,000,000) Fiscal Year 1996: ¥1,800,000 (Direct Cost: ¥1,800,000) Fiscal Year 1995: ¥4,000,000 (Direct Cost: ¥4,000,000)
Keywords	dataflow anaysis / VLDP / branch handling / speculative execution / dynamic prefetch / address prediction / data traffic / ILP / VLDP / 大規模データパスプロセッサ / 分岐予測 / 並列度 / タイム管理モジュール / 処理ステップオ-ガナイザ / マッピングモジュール / データフロー変換 / メモリアロケイタ
Research Abstract	In this research, we took a close look at parallelism extraction, handling of control dependencies, and enhanced memory system, which could be the key technologies of Very Large Data Path (VLDP) architecture. The outcome we gained are as follows. Firstly, we examined the characteristics of parallelism restricted by control dependencies and data dependencies : we found that the VLDP architecture can achieve dozens of times as much performance as conventional scalar processors can, when we have both the perfect branch predictor and the complete memory system. Secondly, we examined suitable branch handling for the VLDP architecture. This branch handling needs to treat simultaneously plural control flow and needs to fetch instructions speculatively across many branches. For this purpose, we proposed a new branch predictor named multi-level branch predictor and examined its behavior. Thirdly, we improved a memory system based on the cache mechanism by utilizing a newly proposed data prefetch algorithm that predicts a referenced address based on the evaluation of a linear function. We confirmed this new system lessens overhead of memory references. Moreover we examined the traffic between the memory and the datapath, which tends to be a bottleneck for high performance processors. We showed that this traffic is reduced to less than half of conventional scheme when the VLDP architecture utilizes the direct data transfer among the remarkable number of ALUs. From these results, technological issues of VLDP architecture are made clear and we confirmed the possibility of performance improvement.

Report

(4 results)

1997 Annual Research Report Final Research Report Summary
1996 Annual Research Report
1995 Annual Research Report