• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2019 Fiscal Year Annual Research Report

ExaPath: Hierarchical Routing for Next-Gen Supercomputers and Beyond

Research Project

Project/Area Number 19H04119
Research InstitutionInstitute of Physical and Chemical Research

Principal Investigator

ドンケ イェンス  国立研究開発法人理化学研究所, 計算科学研究センター, 特別研究員 (70815480)

Project Period (FY) 2019-04-01 – 2024-03-31
KeywordsHPC interconnects
Outline of Annual Research Achievements

We achieved a milestone in the area of HPC interconnects by developing a large-scale proof-of-concept of the HyperX interconnection network.
The HyperX topology has been designed to reduce the point-to-point latency and cost of state-of-the-art fat-tree networks, and our HyperX real-life prototype demonstrated that HyperX topologies are a viable alternative to fat-trees even without adaptive routing. Our novel Pattern-Aware Routing for hyperX (PARX) routing circumvents the bottleneck arising from applying a shortest-path, static routing to a HyperX.
We submitted two publications (one short version in 26th Symposium on High-Performance Interconnects (HOTI 26), and a full paper in the International Conference for High Performance Computing, Networking, Storage and Analysis (SC ’19)) which introduced the novel routing algorithm specifically tailored but not limited to the HyperX topology.
Furthermore, we collaborated with researchers of ETH Zurich to develop a routing for Slimfly, resulting in a Bachelor thesis with the title: "Design and Implementation of Multipath Switching in InfiniBand Slimfly Networks", which developed a multipath-routing algorithm based on PARX underlying principles and adapted to Slimfly which can switch between minimal path or the non-minimal one depending on the message size.
Last but not least, we disseminated our research findings through invited talks internally, via RIKEN R-CCS Cafe, and externally, via the High Performance Consortium for Advanced Scientific and Technical Computing (HP-CAST 32).

Current Status of Research Progress
Current Status of Research Progress

3: Progress in research has been slightly delayed.

Reason

Minor administrative barriers prevented us from hiring a qualified undergraduate student from a domestic university to assist with the research, and hence the initial research plan was slightly delayed.

Strategy for Future Research Activity

The future direction of the research will primarily match the initially outlined plan in the project proposal. We will try to establish more international and domestic collaborations to develop a suitable HPC routing library which hopefully can be interfaced with the OpenFabrics Management Framework (OFMF) and other interconnection management frameworks. And we plan to develop new, and assist in the development (through collaborations) of new, routing algorithms for current and future HPC installations.

  • Research Products

    (7 results)

All 2019 Other

All Journal Article (2 results) (of which Int'l Joint Research: 2 results,  Peer Reviewed: 2 results) Presentation (4 results) (of which Int'l Joint Research: 2 results,  Invited: 2 results) Remarks (1 results)

  • [Journal Article] HyperX Topology: First at-scale Implementation and Comparison to the Fat-Tree2019

    • Author(s)
      Domke Jens、Matsuoka Satoshi、Ivanov Ivan R.、Tsushima Yuki、Yuki Tomoya、Nomura Akihiro、Miura Shin'ichi、McDonald Nie、Floyd Dennis L.、Dube Nicolas
    • Journal Title

      Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

      Volume: SC'19 Pages: 40:1-40:23

    • DOI

      10.1145/3295500.3356140

    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019

    • Author(s)
      Domke Jens、Matsuoka Satoshi、Radanov Ivan、Tsushima Yuki、Yuki Tomoya、Nomura Akihiro、Miura Shin'ichi、McDonald Nic、Floyd Dennis Lee、Dube Nicolas
    • Journal Title

      2019 IEEE Symposium on High-Performance Interconnects (HOTI)

      Volume: HOTI'26 Pages: 4

    • DOI

      10.1109/HOTI.2019.00013

    • Peer Reviewed / Int'l Joint Research
  • [Presentation] HyperX Topology: First at-scale Implementation and Comparison to the Fat-Tree2019

    • Author(s)
      Domke Jens
    • Organizer
      International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19)
    • Int'l Joint Research
  • [Presentation] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019

    • Author(s)
      Domke Jens
    • Organizer
      2019 IEEE Symposium on High-Performance Interconnects
    • Int'l Joint Research
  • [Presentation] The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?2019

    • Author(s)
      Domke Jens
    • Organizer
      The 179th R-CCS Cafe
    • Invited
  • [Presentation] First At-Scale HyperX Implementation: A Compelling Alternative to Fat-Trees?2019

    • Author(s)
      Domke Jens
    • Organizer
      High Performance Consortium for Advanced Scientific and Technical Computing (HP-CAST 32)
    • Invited
  • [Remarks] TSUBAME2 HyperX experiment

    • URL

      https://gitlab.com/domke/t2hx

URL: 

Published: 2021-12-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi