Research Abstract |
In this research, we proposed a cascaded ALU architecture for high-performance and real-time processing. Conventional high-performance superscalar processors suffer from increasing wire delays brought by semiconductor progress because their performance is limited by wire delay in the critical path. The Cascade ALU architecture, in which ALUs are cascaded dynamically to solve RAW dependencies between instructions, solves this problem by making the ALU part critical path. Because ALU speed is not limited by wire delays, the architecture can enjoy any further progress in device speed for an enhancement in processor performance. We have evaluated the performance and area size of the proposed cascade ALU. The results show that the cascade ALU architecture has a good performance scalability and little area penalty compared with current synchronous processors. Since the delay of the Cascade ALU varies depending on executed instructions, asynchronous circuits are suitable for its implementation
… More
. Thus, we developed a CAD system for asynchronous VLSIs. This system, called AlNOS, accepts ordinary synchronous RTL descriptions in Verilog-HDL and generates asynchronous gate-level circuits based on SDI model. SDI is our novel delay model which assumes that the delay scaling variation between any two components is bounded. In the SDI model based design, high-speed operation can be achieved by utilizing delay information while preserving the robustness of circuits. We also proposed a new memory architecture dedicated for high-performance and real-time processing. The memory architecture adopts software controlled memory (SCM) on the processor chip in addition to ordinary cache memory. The SCM and cache can be reconfigured dynamically depending on the characteristics of running applications. Since software can directly specify the data transfer between off-chip memory and SCM, the worst-case performance is strictly guaranteed which is favorable for real-time processing. In order to realize automatic software control, a compilation algorithm is developed and implemented. Less
|