Automated, Scalable, and Machine Learning-Driven Approach for Generating and Optimizing Scientific Application Codes

Research Project

Project/Area Number	23K24856
Project/Area Number (Other)	22H03600 (2022-2023)
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Multi-year Fund (2024) Single-year Grants (2022-2023)
Section	一般
Review Section	Basic Section 60090:High performance computing-related
Research Institution	Institute of Physical and Chemical Research
Principal Investigator	WAHIB MOHAMED 国立研究開発法人理化学研究所, 計算科学研究センター, チームリーダー (00650037)
Co-Investigator(Kenkyū-buntansha)	ドローズドアレクサンドロ国立研究開発法人理化学研究所, 計算科学研究センター, 研究員 (90740126)
Project Period (FY)	2022-04-01 – 2026-03-31
Project Status	Granted (Fiscal Year 2024)
Budget Amount *help	¥17,290,000 (Direct Cost: ¥13,300,000、Indirect Cost: ¥3,990,000) Fiscal Year 2025: ¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000) Fiscal Year 2024: ¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2023: ¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2022: ¥4,680,000 (Direct Cost: ¥3,600,000、Indirect Cost: ¥1,080,000)
Keywords	Code Generation / Neural Networks / Intelligent Programming / HPC / Compilers / Machine Learning / GPUs / Numerical Methods
Outline of Research at the Start	Develop a machine learning framework for automated code generation and optimization, replacing manual porting. Utilize diverse datasets to train models for various hardware architectures. Validate, refine, and deploy the framework within a year for wide accessibility and usability.
Outline of Annual Research Achievements	In this fiscal year we developed an approach that automatically generated neural networks. Neural architecture search is an effective approach for automating the design of deep neural networks. Evolutionary computation (EC) is commonly used in Neural architecture search due to its global optimization capability. However, the evaluation phase of architecture candidates in EC-based NAS is compute-intensive, limiting its application for many real-world problems. To overcome this challenge, we proposed a novel progressive evaluation strategy for the evaluation phase in convolutional neural network architecture search, in which the number of training epochs of network individuals is progressively increased. Our proposed algorithm reduces the computational cost of the evaluation phase and promotes population diversity and fairness by preserving promising networks based on their distribution. We evaluated the proposed progressive evaluation and sub-population preservation of neural architecture search (PEPNAS) algorithm on the CIFAR10, CIFAR100, and ImageNet benchmark datasets, and compare it with 36 state-of-the-art algorithms, including manually designed networks, reinforcement learning (RL) algorithms, gradient-based algorithms, and other EC-based ones. The experimental results demonstrate that PEP-NAS effectively identifies networks with competitive accuracy while also markedly improving the efficiency of the search process.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason The project is progressing as expected. We were capable of publishing several papers.
Strategy for Future Research Activity	Our plan for the next fiscal year is to incorporate our approach to auto-generate neural networks by proposing a progressive neural predictor that uses score-based sampling to improve the performance of the surrogate model with limited training data. Different from existing algorithms that rely on initial sample selection uses an online method to progressively select new samples of the surrogate model based on potential information from the previous search process. During the iterative process, the sampled scores are dynamically adjusted based on the prediction rankings in each round to keep track of good architectures, which gradually optimises the surrogate model. In this way, the processes of training the predictor and searching for architectures are jointly combined to improve the efficiency of sample utilization. In addition, the surrogate model with different degrees of training is assigned prediction confidence equal to the accuracy of the current stage.

Report

(2 results)

2023 Annual Research Report
2022 Annual Research Report

Research Products
(10 results)

All 2024 2023

All Journal Article (6 results) (of which Int'l Joint Research: 6 results, Peer Reviewed: 6 results, Open Access: 6 results) Presentation (4 results) (of which Int'l Joint Research: 4 results, Invited: 4 results)

[Journal Article] Neural Architecture Search With Progressive Evaluation and Sub-Population Preservation2024
- Author(s)
  Xue Yu、Zha Jiajie、Pelusi Danilo、Chen Peng、Luo Tao、Zhen Liangli、Wang Yan、Wahib Mohamed
- Journal Title
  
  IEEE Transactions on Evolutionary Computation
  
  Volume: 1 Pages: 1-7
- DOI
  10.1109/tevc.2024.3393304
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Myths and legends in high-performance computing2023
- Author(s)
  Matsuoka Satoshi、Domke Jens、Wahib Mohamed、Drozd Aleksandr、Hoefler Torsten
- Journal Title
  
  The International Journal of High Performance Computing Applications
  
  Volume: 37 Issue: 3-4 Pages: 245-259
- DOI
  10.1177/10943420231166608
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications2023
- Author(s)
  Zhang Lingqi、Wahib Mohamed、Chen Peng、Meng Jintao、Wang Xiao、Endo Toshio、Matsuoka Satoshi
- Journal Title
  
  ICS '23: Proceedings of the 37th International Conference on Supercomputing
  
  Volume: 1 Pages: 2767-2782
- DOI
  10.1145/3577193.3593705
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Revisiting Temporal Blocking Stencil Optimizations2023
- Author(s)
  Zhang Lingqi、Wahib Mohamed、Chen Peng、Meng Jintao、Wang Xiao、Endo Toshio、Matsuoka Satoshi
- Journal Title
  
  ICS '23: Proceedings of the 37th International Conference on Supercomputing
  
  Volume: 1 Pages: 2767-2782
- DOI
  10.1145/3577193.3593716
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Multi-GPU Communication Schemes for Iterative Solvers: When CPUs are Not in Charge2023
- Author(s)
  Ismayilov Ismayil、Baydamirli Javid、Sagbili Dogan、Wahib Mohamed、Unat Didem
- Journal Title
  
  ICS '23: Proceedings of the 37th International Conference on Supercomputing
  
  Volume: 1 Pages: 192-202
- DOI
  10.1145/3577193.3593713
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads2023
- Author(s)
  Domke Jens、Vatai Emil、Gerofi Balazs、Kodama Yuetsu、Wahib Mohamed、Podobas Artur、Mittal Sparsh、Pericas Miquel、Zhang Lingqi、Chen Peng、Drozd Aleksandr、Matsuoka Satoshi
- Journal Title
  
  ACM Transactions on Architecture and Code Optimization
  
  Volume: 20 Issue: 4 Pages: 1-26
- DOI
  10.1145/3629520
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Scaling Large Scale ML Workloads2024
- Author(s)
  Mohamed Wahib
- Organizer
  SOS-26
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] AI for Science: An Update on Research Activities from RIKEN- CCS2024
- Author(s)
  Mohamed Wahib
- Organizer
  ADAC'24 Workshop
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Challenges of Scaling Deep Learning on HPC Systems2024
- Author(s)
  Mohamed Wahib
- Organizer
  Challenges of Scaling Deep Learning on HPC Systems
- Related Report
  2022 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] High Performance Imaging Applications: At the Intersection of HPC and AI2024
- Author(s)
  Mohamed Wahib
- Organizer
  Electronic Imaging’24
- Related Report
  2022 Annual Research Report
- Int'l Joint Research / Invited

Automated, Scalable, and Machine Learning-Driven Approach for Generating and Optimizing Scientific Application Codes

Principal Investigator

WAHIB MOHAMED 国立研究開発法人理化学研究所, 計算科学研究センター, チームリーダー (00650037)

¥17,290,000 (Direct Cost: ¥13,300,000、Indirect Cost: ¥3,990,000)

Current Status of Research Progress

Reason

Report

Research Products

[Journal Article] Neural Architecture Search With Progressive Evaluation and Sub-Population Preservation2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Myths and legends in high-performance computing2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Revisiting Temporal Blocking Stencil Optimizations2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Multi-GPU Communication Schemes for Iterative Solvers: When CPUs are Not in Charge2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads2023

Author(s)

Journal Title

DOI

Related Report

[Presentation] Scaling Large Scale ML Workloads2024

Author(s)

Organizer

Related Report

[Presentation] AI for Science: An Update on Research Activities from RIKEN- CCS2024

Author(s)

Organizer

Related Report

[Presentation] Challenges of Scaling Deep Learning on HPC Systems2024

Author(s)

Organizer

Related Report

[Presentation] High Performance Imaging Applications: At the Intersection of HPC and AI2024

Author(s)

Organizer

Related Report