研究課題/領域番号 |
23K19979
|
研究機関 | 慶應義塾大学 |
研究代表者 |
魏 凱傑 慶應義塾大学, 理工学研究科(矢上), 特任助教 (90983867)
|
研究期間 (年度) |
2023-08-31 – 2025-03-31
|
キーワード | state vector, / HLS, / Qulacs, / Serial ATA, / FPGA, / quantum computer, / storage system, / simulation |
研究実績の概要 |
In Y2023, the applicant implements the initial design of Trefoil-Qulacs. First, I finished the high-level synthesis (HLS) designs of the Hadamard, Pauli-Z, Phase, Controlled-NOT, and Unitary Matrix using Vitis HLS 2022.2 based on Qulacs. In this stage, I emphasize the algorithm optimization using HLS techniques. To enhance the robustness of the Trefoil-Qulacs, I redesigned the referred quantum gate algorithm, which considers the SATA disk characteristics with the FPGA platform. The Trefoil can construct any quantum circuit with the combination of designed IPs. Furthermore, I verified the designed IPs on the Trefoil storage subsystem after the overall system's implementation using Vivado 2022.2. Since one IP instance has to correspond to a SATA disk, 32 IPs should be involved in the target design. There is a great challenge to on-chip memory resources. According to the evaluations using the physical board, the system can realize 43-qubit quantum circuit simulation using 32 8TB SATA disks, which can only achieved by supercomputers till now. However, the system needs more long-term simulations than supercomputer simulations due to the inefficient communication between SATA disks and FPGA. During the preparation for demonstrations at academic conferences (FPT, MCSoC) and Technique EXPO, I created two more IPs in the substitution of LiteX proposed in the research plan, which enables a more friendly environment for quantum circuit simulation using display. Regarding these two IPs, one is for communication with SATA disks, and the other is in the place of the CPU using Rasberry Pi.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
1: 当初の計画以上に進展している
理由
According to the research plan, the Trefoil storage subsystem was initially devised to function on XILINX Ultrascale XKU060 and establish a connection with 16 SATA disks. However, after its implementation, I realized that the system design needed to be more stable to handle the given workload, and the on-chip resources needed to be improved. In light of this, I have changed the platform, and after a thorough analysis, the FPGA chip was upgraded to XILINX Ultrscale+ XCKU15P. The new platform can connect with 32 SATA disks and has undergone rigorous testing to ensure its robustness and stability, thus ensuring optimal efficiency. Between July and October, I redesign the IPs by customizing the on-chip resources. To ensure the system was reliable and efficient, I moved away from relying solely on simulation results. Instead, I began implementing experiments on the actual platform. After finishing the initial implementation of the whole system on the new Trefoil storage subsystem, I presented several demonstrations at academic conferences, including FPT and MCSoC, to demonstrate the system's capabilities. During a press statement in December, I improved the system's reliability and provided a more user-friendly environment that enables easy access to the system. To effectively control and confirm the functionality of the Trefoil storage subsystem, a Raspberry Pi is connected to the FPGA's GPIO working as CPU part. Additionally, the proposed system can connect with a portable display. Users can interactively visualize the simulation results with a Python-based graphical interface.
|
今後の研究の推進方策 |
According to my two-year research plan, I put great effort into system optimization and research publication in Y2024. According to our analysis of the system time consumption, the data communication between SATA disks and FPGA dominates the simulation. Based on the current system's design. I have to exploit the possibility of pipelining the process of reading and computation to boost the target system's performance. Thus, I will spend three months (April~June) optimizing the design instead of focusing on the computation part. However, as described previously, each designed IP has to correspond to a SATA disk. Thus, 32 Qulacs IPs are involved in the system design. I have to control the resource utilization of each IP around 3% with ensuring the construction of a full-geared system connecting with 32 SATA disks, which challenges the limited on-chip resource. A dedicated resource allocation is necessary for Trefoil-Qulacs. I target a fully optimized system whose time consumption only concerns the communication between SATA disks and FPGA. On the other hand, regarding the system evaluation and verification, I will check the simulation results from the GUI interface created in Y2023 and the simulation in Vitis HLS 2022.2.2. As the final step, I will summarize the system in a journal paper, including detailed system designs and performance evaluations. Unlike the original proposal, the journal paper focuses on implementing the Qulacs-Trefoil design with the Trefoil storage subsystem connecting with 32 SATA disks. I will further improve the evaluation results during this process.
|
次年度使用額が生じた理由 |
For the last fiscal year (Y2023), since I have focused on the system design instead of the evaluation part, there is no cost for the machines for comparison with the counterparts. In Y2024, I expect to spend the research fund as follows, The G-Tune HP-Z Workstation works as the primary development environment for the target system using Vivado Design Suite. Besides, embedded GPU GeForce RTX 4080 is one of the counterparts for the target system, Trefoil-Qulacs. EK-U1-ZCU106-G-J is a Xilinx Zynq UltraScale+ MPSoC ZCU106 Evaluation Kit, providing abundant logic computation resources. Evaluating the HLS-based Qulacs design on different platforms can help determine the deficiencies and possibilities for the next generation's Trefoil-Qulacs. Apple MacBook Pro is used for research and presentation of research results. Research materials with related knowledge are needed to facilitate the study, which is the workspace for LiteX design for the Trefoil target board. Due to the heavy program, a high-spec PC device is required. On the other hand, I will spend the remaining part on travel expenses for academic conferences and journal paper publication, as explained in the research plan of Y2024.
|