• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Fault tolerant computing based on a multi-SPMD programming/execution environment

Research Project

Project/Area Number 26730064
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeMulti-year Fund
Research Field High performance computing
Research InstitutionInstitute of Physical and Chemical Research

Principal Investigator

Tsuji Miwako  国立研究開発法人理化学研究所, 計算科学研究機構, 研究員 (80466466)

Project Period (FY) 2014-04-01 – 2017-03-31
Project Status Completed (Fiscal Year 2016)
Budget Amount *help
¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2015: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2014: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Keywords耐故障性 / プログラミングモデル / ワークフロー / 国際情報交換(アメリカ) / 国際情報交換(フランス) / 国際情報交換(アメリカ) / 国際情報交換(フランス)
Outline of Final Research Achievements

In this research, we have supported fault tolerance features in a multi-SPMD programming/execution environment, where tasks in a workflow are executed in distributed parallel. The programming environment adopts multi-programming methodologies across multi-architectural levels, such as Numa-core groups in a node, nodes in a cluster, a cluster of clusters, to realize scalability. To achieve a fault tolerance and resilience mechanism without any modification of the application’s source code, we have developed middleware to detect errors in remote programs and extended workflow scheduler to realize fault resilience.

Report

(4 results)
  • 2016 Annual Research Report   Final Research Report ( PDF )
  • 2015 Research-status Report
  • 2014 Research-status Report
  • Research Products

    (7 results)

All 2017 2016 2015 2014

All Presentation (7 results) (of which Int'l Joint Research: 1 results,  Invited: 1 results)

  • [Presentation] 大規模システムにおける耐故障マルチ SPMD プログラミング開発実行環境の応用と評価2017

    • Author(s)
      辻美和子
    • Organizer
      第158回ハイパフォーマンスコンピューティング研究発表会
    • Place of Presentation
      大月ホテル和風館(静岡県・熱海市)
    • Year and Date
      2017-03-08
    • Related Report
      2016 Annual Research Report
  • [Presentation] Multi-SPMD Programming Paradigm for Extreme Computing2016

    • Author(s)
      Miwako Tsuji
    • Organizer
      The 6th AICS International Symposium
    • Place of Presentation
      AICS (兵庫県神戸市)
    • Year and Date
      2016-02-22
    • Related Report
      2015 Research-status Report
    • Invited
  • [Presentation] Fault tolerance features of a new multi-SPMD programming/execution environment2015

    • Author(s)
      Miwako Tsuji, Serge Petiton and Mitsuhisa Sato
    • Organizer
      First International Workshop on Extreme Scale Programming Models and Middleware
    • Place of Presentation
      Austin, TX, USA
    • Year and Date
      2015-11-14
    • Related Report
      2015 Research-status Report
    • Int'l Joint Research
  • [Presentation] Fault Tolerance features of YML-XMP2015

    • Author(s)
      Miwako Tsuji
    • Organizer
      Workshop on Langage and Programming Paradigm for Exascale Applications
    • Place of Presentation
      Houston, TX, USA
    • Year and Date
      2015-03-12 – 2015-03-13
    • Related Report
      2014 Research-status Report
  • [Presentation] マルチSPMDプログラミング開発実行環境における耐故障性実現に向けたワークフロースケジューリングの検討2015

    • Author(s)
      辻美和子,佐藤三久
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      大分県別府市
    • Year and Date
      2015-03-02 – 2015-03-03
    • Related Report
      2014 Research-status Report
  • [Presentation] マルチSPMD環境に向けたXMP/YMLの活用2014

    • Author(s)
      辻美和子
    • Organizer
      第2回XcalableMPワークショップ
    • Place of Presentation
      東京都千代田区
    • Year and Date
      2014-10-24
    • Related Report
      2014 Research-status Report
  • [Presentation] マルチSPMD環境における耐故障性実現に向けた OmniRPC-MPI の拡張2014

    • Author(s)
      辻美和子,佐藤三久
    • Organizer
      情報処理学会研究報告
    • Place of Presentation
      沖縄県沖縄市
    • Year and Date
      2014-10-02
    • Related Report
      2014 Research-status Report

URL: 

Published: 2014-04-04   Modified: 2018-03-22  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi