1997 Fiscal Year Final Research Report Summary
Research on a Very Large Data Path processor
Project/Area Number |
07458052
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
計算機科学
|
Research Institution | University of Tokyo |
Principal Investigator |
TANAKA Hidehiko The University of Tokyo, Graduate School of Engineering, 大学院・工学系研究科, 教授 (60011102)
|
Project Period (FY) |
1995 – 1997
|
Keywords | dataflow anaysis / VLDP / branch handling / speculative execution / dynamic prefetch / address prediction / data traffic / ILP |
Research Abstract |
In this research, we took a close look at parallelism extraction, handling of control dependencies, and enhanced memory system, which could be the key technologies of Very Large Data Path (VLDP) architecture. The outcome we gained are as follows. Firstly, we examined the characteristics of parallelism restricted by control dependencies and data dependencies : we found that the VLDP architecture can achieve dozens of times as much performance as conventional scalar processors can, when we have both the perfect branch predictor and the complete memory system. Secondly, we examined suitable branch handling for the VLDP architecture. This branch handling needs to treat simultaneously plural control flow and needs to fetch instructions speculatively across many branches. For this purpose, we proposed a new branch predictor named multi-level branch predictor and examined its behavior. Thirdly, we improved a memory system based on the cache mechanism by utilizing a newly proposed data prefetch algorithm that predicts a referenced address based on the evaluation of a linear function. We confirmed this new system lessens overhead of memory references. Moreover we examined the traffic between the memory and the datapath, which tends to be a bottleneck for high performance processors. We showed that this traffic is reduced to less than half of conventional scheme when the VLDP architecture utilizes the direct data transfer among the remarkable number of ALUs. From these results, technological issues of VLDP architecture are made clear and we confirmed the possibility of performance improvement.
|