• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Adaptive Auto-tuning Technology Aiming Complex Multicore and Multiprocessor Environments

Research Project

Project/Area Number 21300013
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Computer system/Network
Research InstitutionThe University of Electro-Communications

Principal Investigator

IMAMURA Toshiyuki  電気通信大学, 大学院・情報理工学研究科, 准教授 (60361838)

Co-Investigator(Kenkyū-buntansha) KATAGIRI Takahiro  東京大学, 情報基盤センター, 准教授 (40345434)
SUDA Reiji  東京大学, 情報理工学系研究科, 教授 (40251392)
TAKAHASHI Daisuke  筑波大学, システム情報工学研究科, 准教授 (00292714)
YAMAMOTO Yusaku  神戸大学, システム情報学研究科, 教授 (20362288)
NAKAJIMA Kengo  東京大学, 情報基盤センター, 教授 (20376528)
Project Period (FY) 2009 – 2011
Project Status Completed (Fiscal Year 2011)
Budget Amount *help
¥17,550,000 (Direct Cost: ¥13,500,000、Indirect Cost: ¥4,050,000)
Fiscal Year 2011: ¥5,200,000 (Direct Cost: ¥4,000,000、Indirect Cost: ¥1,200,000)
Fiscal Year 2010: ¥5,330,000 (Direct Cost: ¥4,100,000、Indirect Cost: ¥1,230,000)
Fiscal Year 2009: ¥7,020,000 (Direct Cost: ¥5,400,000、Indirect Cost: ¥1,620,000)
Keywordsマルチコア / 実行時最適化 / 動的負荷分散 / ハイブリッド並列プログラミングモデル / GPGPU / QR分解 / 固有値計算 / 高性能BLAS / 性能データベース
Research Abstract

To establish a key component of"automatic tuning technology"on multi-core processors and multi-GPU's for a next-generation supercomputer system, we conducted this research project. In this work, we developed an"automatic tuning technique employing performance stabilization on GPU's"and an"automatic tuning technique which utilizes a performance database". Furthermore, we successfully gave the semioptimal experimental design based on a Bayesian model through the mathematical research of the automatic tuning when the environmental conditions change dynamically. AT(automatic-tuning) for eigenvalue solvers, Fast Fourier Transform, and iterative solver with preconditioning were exploited on a cluster system or a multicore computer system.

Report

(4 results)
  • 2011 Annual Research Report   Final Research Report ( PDF )
  • 2010 Annual Research Report
  • 2009 Annual Research Report
  • Research Products

    (99 results)

All 2012 2011 2010 2009

All Journal Article (33 results) (of which Peer Reviewed: 21 results) Presentation (64 results) Book (2 results)

  • [Journal Article] CUDA環境下でのDGEMV関数の性能安定化・自動チューニングに関する考察2011

    • Author(s)
      今村俊幸
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: Vol.4, No.4 Pages: 158-168

    • NAID

      40019259197

    • Related Report
      2011 Annual Research Report 2011 Final Research Report
  • [Journal Article] 疎行列-ベクトル積におけるブロック化BSS法と高スレッド並列環境での性能評価2011

    • Author(s)
      片桐孝洋, 佐藤雅彦
    • Journal Title

      情報処理学会論文誌:ACS

      Volume: Vol.4, No.3 Pages: 1-8

    • NAID

      170000065500

    • Related Report
      2011 Final Research Report
  • [Journal Article] On Auto-tuned Pre/postprocessing for the Singular Value Decomposition of Dense Square Matrices2011

    • Author(s)
      Hiroki Toyokawa, Kinji Kimura, Yusaku Yamamoto, Masami Takata, Akira Ajisaka and Yoshimasa Nakamura
    • Journal Title

      情報処理学会論文誌コンピューティングシステム(ACS)

      Volume: Vol.4, No.3 Pages: 9-21

    • NAID

      130000654785

    • URL

      https://www.jstage.jst.go.jp/article/ipsjtrans/4/0/4_0_134/_pdf

    • Related Report
      2011 Final Research Report
  • [Journal Article] 動的計画法を用いたブロックハウスホルダーQR分解アルゴリズムの性能最適化2011

    • Author(s)
      深谷猛, 山本有作, 張紹良
    • Journal Title

      情報処理学会論文誌コンピューティングシステム(ACS)

      Volume: Vol.4, No.4 Pages: 146-157

    • NAID

      40019259189

    • Related Report
      2011 Final Research Report
  • [Journal Article] Development of a High Performance Eigensolver on the Peta-Scale Next Generation Supercomputer System2011

    • Author(s)
      Toshiyuki Imamura, Susumu Yamada, Masahiko Machida
    • Journal Title

      Progress in Nuclear Science and Technology, the Atomic Energy Society of Japan

      Volume: 2 Pages: 643-650

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Direct extension of the density-matrix renormalization group method to ward two-dimensional large quantum lattices and related high-performance computing2011

    • Author(s)
      Susumu Yamada, Masahiko Okumura, Toshiyuki Imamura, Masahiko Machida
    • Journal Title

      Japan Journal of Industrial and Applied Mathematics, Area 3

      Volume: 28 Pages: 141-151

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 疎行列-ベクトル積におけるブロック化BSS法と高スレッド並列環境での性能評価2011

    • Author(s)
      片桐孝洋, 佐藤雅彦
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: 4 Pages: 1-8

    • NAID

      170000065500

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Cache Optimization of a Non-Orthogonal Joint Diagonalization Method2011

    • Author(s)
      Yusuke Hirota, Yusaku Yamamoto, Shao-Liang Zhang
    • Journal Title

      JSIAM Letters

      Volume: 3 Pages: 9-12

    • NAID

      130000433790

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] On Auto-tuned Pre/postprocessing for the Singular Value Decomposition of Dense Square Matrices2011

    • Author(s)
      Hiroki Toyokawa, Kinji Kimura, Yusaku Yamamoto, Masami Takata, Akira Ajisaka, Yoshimasa Nakamura
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: 4 Pages: 9-21

    • NAID

      130000654785

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Optimization of the Multishift QR Algorithm with Coprocessors for Non-Hermitian Eigenvalue Problems2011

    • Author(s)
      Takafumi Miyata, Yusaku Yamamoto, Takashi Uneyama, Yoshimasa Nakamura, Shao-Liang Zhang
    • Journal Title

      East Asian Journal on Applied Mathematics

      Volume: 1 Pages: 187-196

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Block IDR(s) Method for Nonsymmetric Linear Systems with Multiple Right-Hand Sides2011

    • Author(s)
      Lei Du, Tomohiro Sogabe, Bo Yu, Yusaku Yamamoto, Shao-Liang Zhang
    • Journal Title

      Computational and Applied Mathematics

      Volume: 235 Pages: 4095-4106

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Acceleration of Hessenberg Reduction for Nonsymmetric Eigenvalue Problems in a Hybrid CPU-GPU Computing Environment2011

    • Author(s)
      Jun-ichi Muramatsu, Takeshi Fukaya, Shao-Liang Zhang, Kinji Kimura, Yusaku Yamamoto
    • Journal Title

      International Journal of Networking and Computing

      Volume: 1 Pages: 132-143

    • NAID

      130005475258

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 動的計画法を用いたブロックハウスホルダQR分解アルゴリズムの性能最適化2011

    • Author(s)
      深谷猛, 山本有作, 張紹良
    • Journal Title

      情報処理学会論文誌コンピューティングシステム

      Volume: 4 Pages: 146-157

    • NAID

      40019259189

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A parallel algorithm for incremental orthogonalization based on the compact WY representation2011

    • Author(s)
      Yusaku Yamamoto, Yusuke Hirota
    • Journal Title

      JSIAM Letters

      Volume: 3 Pages: 89-92

    • NAID

      130002129403

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Parallel Multigrid Solvers using OpenMP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters2011

    • Author(s)
      Nakajima, K
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 6449 Pages: 185-199

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors2011

    • Author(s)
      Daisuke Takahashi
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 6067 Pages: 604-614

    • Related Report
      2011 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A B\"{a}cklund Transformation Between Two Integrable Discrete Hungry Systems2011

    • Author(s)
      Fukuda, A., Yamamoto, Y., Iwasaki, M., Ishiwata, E., Nakamura, Y.
    • Journal Title

      Physics Letters A

      Volume: 375(3) Pages: 303-308

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] GPGPUにおけるデータ転送とカーネル実行のヒューリスティックスケジューリング2011

    • Author(s)
      本間咲来, 須田礼仁
    • Journal Title

      情報処理学会研究報告

      Volume: HPC-129-22

    • Related Report
      2010 Annual Research Report
  • [Journal Article] Narrow-band reductionapproach of a DRSM eigensolver on amulticore-based cluster system2010

    • Author(s)
      Imamura. T., Yamada, S., and Machida, M
    • Journal Title

      Parallel Computing : From Multicoresand GPU's to Petascale

      Volume: Vol.19 Pages: 91-98

    • Related Report
      2011 Final Research Report
  • [Journal Article] 超並列環境向きの固有値計算アルゴリズムと自動チューニング2010

    • Author(s)
      今村俊幸
    • Journal Title

      応用数理

      Volume: 20 Pages: 26-36

    • Related Report
      2011 Final Research Report 2010 Annual Research Report
  • [Journal Article] ペタフロップス環境における小規模行列用対称密行列固有値ソルバに向けて-逆変換の改良-2010

    • Author(s)
      片桐孝洋
    • Journal Title

      情報処理学会論文誌:ACS

      Volume: Vol.3, No.2 Pages: 1-8

    • NAID

      110007990286

    • Related Report
      2011 Final Research Report
  • [Journal Article] Parallel Multistage Preconditioners by Extended Hierarchical Interface Decompositionfor Ill-Conditioned Problems2010

    • Author(s)
      Nakajima, K.
    • Journal Title

      From Multicores and GPU's to Petascale

      Volume: (IOS press) Pages: 99-106

    • Related Report
      2011 Final Research Report
  • [Journal Article] High-Performance Quantum Simulation for Coupled Josephson Junctions on the Earth Simulator : A challenge to Schroedinger Equation on 256^4Grids2010

    • Author(s)
      Imamura, T., Kano, T., Yamada, S., Okumura, M., Machida, M.
    • Journal Title

      International Journal of High Performance Computing

      Volume: 24(3) Pages: 319-334

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Narrow-band reduction approach of a DRSM eigensolver on a multicore-based cluster system2010

    • Author(s)
      Imamura.T., Yamada, S., Machida, M
    • Journal Title

      Parallel Computing : From Multicores and GPU's to Petascale

      Volume: 19 Pages: 91-98

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Differential qd Algorithm for Totally Nonnegative Hessenberg Matrices : Introduction of Origin Shifts and Relationship with the Discrete Hungry Lotka-Volterra System2010

    • Author(s)
      Yamamoto, Y., Fukaya T.
    • Journal Title

      JSIAM Letters

      Volume: 2 Pages: 69-72

    • NAID

      130000303923

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 密正方行列特異値分解における並列I-SVD法の特性を用いた後処理の高速化2010

    • Author(s)
      豊川博己, 山本有作, 木村欣司, 高田雅美, 中村佳正
    • Journal Title

      情報処理学会論文誌コンピューティングシステム(ACS)

      Volume: 30(2) Pages: 30-38

    • NAID

      110007990289

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] オフライン自動チューニングの数理手法2010

    • Author(s)
      須田礼仁
    • Journal Title

      情報処理学会研究報告

      Volume: HPC-125-3

    • NAID

      110007995482

    • Related Report
      2010 Annual Research Report
  • [Journal Article] High-Performance Quantum Simulation for Coupled Josephson Junctions on the Earth Simulator : A challenge to Schrodinger Equation on 256^4 Grids2009

    • Author(s)
      Imamura, T., Kano, T., Yamada S., Okumura, M., Machida, M
    • Journal Title

      International Journal of High Performance Computing (オンライン)

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] MPI通信ライブラリの自動チューニング2009

    • Author(s)
      今村俊幸
    • Journal Title

      情報処理 50

      Pages: 523-526

    • Related Report
      2009 Annual Research Report
  • [Journal Article] ソフトウェア自動チューニングの数理2009

    • Author(s)
      須田礼仁
    • Journal Title

      情報処理 50

      Pages: 487-493

    • Related Report
      2009 Annual Research Report
  • [Journal Article] 正方行列向け特異値分解のCUDAによる高速化2009

    • Author(s)
      深谷猛, 山本有作, 畝山多加志, 中村佳正
    • Journal Title

      情報処理学会論文誌ACS 2

      Pages: 98-109

    • NAID

      110007990234

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Differential qd Algorithm for Totally Nonnegative Band Matrices : Convergence Properties and Error Analysis2009

    • Author(s)
      Y.Yamamoto, T.Fukaya
    • Journal Title

      JSIAM Letters 1

      Pages: 56-59

    • NAID

      130000133093

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 行列の指数関数に基づく連立線形常微分方程式の大粒度並列解法とその評価2009

    • Author(s)
      則竹渚宇, 今倉暁, 山本有作, 張紹良
    • Journal Title

      日本応用数理学会論文誌 19

      Pages: 293-312

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Presentation] 変動する条件に適応するオンライン自動チューニング2012

    • Author(s)
      須田礼仁
    • Organizer
      日本応用数理学会2011年度年会
    • Place of Presentation
      同志社大学今出川キャンパス
    • Year and Date
      2012-09-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] ASPEN-K2 : Automatic-tuning and Stabilization for the Performance of CUDA BLAS Level 2 Kernels2012

    • Author(s)
      T.Imamura
    • Organizer
      SIAM 15th Conference on Parallel Processing for Scientific Computing (PP12)
    • Place of Presentation
      Savannah, USA
    • Year and Date
      2012-02-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] Dynamical Variation of Eigenvalue Problems in Density-Matrix Renormalization-Group Code2012

    • Author(s)
      S.Yamada, T.Imamura, M.Machida
    • Organizer
      SIAM 15th Conference on Parallel Processing for Scientific Computing (PP12)
    • Place of Presentation
      Savannah, USA
    • Year and Date
      2012-02-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] Coarse Grid Solvers in Parallel Multigrid Methods using OpenMP/MPI Hybrid Programming Models2012

    • Author(s)
      Nakajima, K
    • Organizer
      15th SIAM Conference on Parallel Processing for Scientific Computing (PP12)
    • Place of Presentation
      Savannah, Georgia, USA
    • Year and Date
      2012-02-15
    • Related Report
      2011 Annual Research Report
  • [Presentation] New Strategy for Coarse Grid Solvers in Parallel Multigrid Methods using OpenMP/MPI Hybrid Programming Models2012

    • Author(s)
      Nakajima, K.
    • Organizer
      ACM PPoPP/PMAM
    • Place of Presentation
      New Orleans, LA, USA
    • Related Report
      2011 Annual Research Report
  • [Presentation] 京コンピュータに向けた密行列固有値ソルバーの開発について2012

    • Author(s)
      今村俊幸
    • Organizer
      2012年ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2012)
    • Place of Presentation
      名古屋大学豊田講堂シンポジオンホール(招待講演)
    • Related Report
      2011 Annual Research Report
  • [Presentation] 自動チューニングによる通信最適化を施した固有値ソルバの開発について2012

    • Author(s)
      近藤大貴, 吉田剛啓, 田村遼也, 今村俊幸
    • Organizer
      第133回ハイパフォーマンスコンピューティング研究発表会
    • Place of Presentation
      有馬ビューホテルうらら
    • Related Report
      2011 Annual Research Report
  • [Presentation] Towards Auto-tuning Description Language to Heterogeneous Computing Environment2011

    • Author(s)
      Takahiro Katagiri
    • Organizer
      Fifth International Workshop on High-level Parallel Programming and Applications (HLPP 2011)
    • Place of Presentation
      Tokyo
    • Year and Date
      2011-09-18
    • Related Report
      2011 Annual Research Report
  • [Presentation] GPGPUにおけるデータ転送とカーネル実行のヒューリスティックスケジューリング2011

    • Author(s)
      本間咲来, 須田礼仁
    • Organizer
      情報処理学会HPC研究会
    • Place of Presentation
      東京大学
    • Year and Date
      2011-05-11
    • Related Report
      2011 Annual Research Report
  • [Presentation] Performance Evaluation for a Dense Eigenvalue Solver for the Next-generation Petascale System2011

    • Author(s)
      Imamura.T., Pham, H.P., Yamada, S., Machida, M.
    • Organizer
      SIAM CSE2011
    • Place of Presentation
      Reno, USA
    • Year and Date
      2011-03-01
    • Related Report
      2010 Annual Research Report
  • [Presentation] Parallelization Design for Multi-core Platforms in Density Matrix Renormalization Grouptoward 2-D Quantum Strongly-correlated Systems2011

    • Author(s)
      Susumu Yamada, Toshiyuki Imamura, and Masahiko Machida
    • Organizer
      ACM/IEEE the International Conferencefor High Performance Computing
    • Place of Presentation
      USB-memory
    • Related Report
      2011 Final Research Report
  • [Presentation] Auto-tuning for BLAS-based Matrix Computations2011

    • Author(s)
      Takeshi Fukaya, Yusaku Yamamoto, and Shao-Liang Zhang
    • Organizer
      SIAM Conference on Computational Science and Engineering(CSE11)
    • Place of Presentation
      Nevada
    • Related Report
      2011 Final Research Report
  • [Presentation] Parallelization Design for Multi-core Platforms in Density Matrix Renormalization Group toward 2-D Quantum Strongly-correlated Systems2011

    • Author(s)
      Susumu Yamada, Toshiyuki Imamura, Masahiko Machida
    • Organizer
      ACM/IEEE the International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)
    • Place of Presentation
      Seattle, USA
    • Related Report
      2011 Annual Research Report
  • [Presentation] An Auto-tuning Method for Run-time Data Transformation for Sparse Matrix-Vector Multiplication2011

    • Author(s)
      Takahiro Katagiri, Masahiko Sato
    • Organizer
      第130回HPC研究会,2011年並列/分散/協調処理に関する『鹿児島』サマー・ワークショップ(SWoPP鹿児島2011)
    • Place of Presentation
      かごしま県民交流センター
    • Related Report
      2011 Annual Research Report
  • [Presentation] Automatic Performance Tuning for the Blocked Householder QR Algorithm2011

    • Author(s)
      Takeshi Fukaya, Yusaku Yamamoto, Shao-Liang Zhang
    • Organizer
      The 7th East Asia SIAM Conference & RIMS Workshop on Methods in Industrial and Applied Mathematics
    • Place of Presentation
      Waseda University, Kitakyushu Campus, Kitakyushu, Fukuoka
    • Related Report
      2011 Annual Research Report
  • [Presentation] Infrastructure for Application Development on Heterogeneous Parallel Computers2011

    • Author(s)
      Nakajima, K., Hayashi, M., Ohshima, S
    • Organizer
      7th International Congress on Industrial and Applied Mathematics (ICIAM 2011)
    • Place of Presentation
      Vancouver, Canada
    • Related Report
      2011 Annual Research Report
  • [Presentation] 階層型領域間境界分割に基づくハイブリッド並列プログラミングモデル向け前処理手法2011

    • Author(s)
      中島研吾, 林雅江, 大島聡史
    • Organizer
      日本応用数理学会「行列・固有値問題の解法とその応用」研究部会(MEPA)
    • Place of Presentation
      かごしま県民交流センター
    • Related Report
      2011 Annual Research Report
  • [Presentation] 疎行列・ベクトル積におけるブロック化BSS法と高スレッド並列環境での性能評価2011

    • Author(s)
      片桐孝洋、佐藤雅彦
    • Organizer
      2011年ハイパフォーマンスコンピューティングと計算科学論文集HPCS2011
    • Place of Presentation
      産業技術総合研究所、つくば
    • Related Report
      2010 Annual Research Report
  • [Presentation] eigen_sg:マルチコア+GPGPU環境における固有値ソルバ開発2011

    • Author(s)
      今村俊幸, 山田進, 町田昌彦
    • Organizer
      2011年ハイパフォーマンスコンピューティングと計算科学論文集HPCS2011
    • Place of Presentation
      産業技術総合研究所、つくば
    • Related Report
      2010 Annual Research Report
  • [Presentation] テラからペタスケール環境での密行列固有値ソルバの性能予測2010

    • Author(s)
      今村俊幸
    • Organizer
      第二回特異値・固有値ワークショップ
    • Place of Presentation
      筑波大学【招待講演】
    • Year and Date
      2010-11-27
    • Related Report
      2010 Annual Research Report
  • [Presentation] Development of a High Performance Eigensolver beyond Peta-scale Supercomputer Systems2010

    • Author(s)
      Imamura, T., Yamada, S., Machida, M.
    • Organizer
      Minisymposium organized by JAEA at SC10
    • Place of Presentation
      New Orleans, USA(招待演講)
    • Year and Date
      2010-11-17
    • Related Report
      2010 Annual Research Report
  • [Presentation] 疎行列反復解法ライブラリにおける自動チューニング機能の開発2010

    • Author(s)
      片桐孝洋
    • Organizer
      京都大学数理解析研究所研究集会
    • Place of Presentation
      京都大学数理解析研究所
    • Year and Date
      2010-10-19
    • Related Report
      2011 Final Research Report
  • [Presentation] 疎行列反復解法ライブラリにおける自動チューニング機能の開発2010

    • Author(s)
      片桐孝洋
    • Organizer
      京都大学数理解析研究所研究集会、科学技術計算アルゴリズムの数理的基盤と展開(代表者:大石進一(早稲田大学))
    • Place of Presentation
      京都大学【招待講演】
    • Year and Date
      2010-10-19
    • Related Report
      2010 Annual Research Report
  • [Presentation] マルチコア時代の並列前処理手法2010

    • Author(s)
      中島研吾
    • Organizer
      京都大学数理解析研究所研究集会、科学技術計算アルゴリズムの数理的基盤と展開(代表者:大石進一(早稲田大学))
    • Place of Presentation
      京都大学
    • Year and Date
      2010-10-19
    • Related Report
      2010 Annual Research Report
  • [Presentation] A Dynamic Programming Approach to Auto-Tuningthe Blocking Strategy For the Householder QR Decomposition2010

    • Author(s)
      Takeshi Fukaya, Yusaku Yamamoto, and Shao-Liang Zhang
    • Organizer
      Workshop on Advanced Auto-tuning on Numerical Software(AANS2010)
    • Place of Presentation
      Tokyo
    • Year and Date
      2010-08-02
    • Related Report
      2011 Final Research Report
  • [Presentation] Automatic Tuningfor Parallel 3-D FFTs2010

    • Author(s)
      Daisuke Takahashi
    • Organizer
      2010 SIAMAnnual Meeting
    • Place of Presentation
      USA
    • Year and Date
      2010-07-16
    • Related Report
      2011 Final Research Report
  • [Presentation] Automatic Tuning for Parallel 3-D FFTs2010

    • Author(s)
      Takahashi, D.
    • Organizer
      2010 SIAM Annual Meeting
    • Place of Presentation
      Pittsburgh, USA
    • Year and Date
      2010-07-16
    • Related Report
      2010 Annual Research Report
  • [Presentation] Challenges of Run-time Auto-tuning for Sparse Iterative Solvers2010

    • Author(s)
      Takahiro Katagiri
    • Organizer
      Fifth International Workshop on Automatic Performance Tuning(iWAPT2010)
    • Place of Presentation
      Berkeley, California, USA
    • Year and Date
      2010-06-22
    • Related Report
      2011 Final Research Report
  • [Presentation] オフライン自動チューニングの数理手法2010

    • Author(s)
      須田礼仁
    • Organizer
      情報処理学会HPC研究会
    • Place of Presentation
      東京大学
    • Year and Date
      2010-06-17
    • Related Report
      2011 Final Research Report
  • [Presentation] Parallel Multigrid Solvers using OpenMP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters2010

    • Author(s)
      中島研吾
    • Organizer
      SIAM 14th Conference on Parallel Processing for Scientific Computing (PP10), MS 55 : Joint JSIAM-SIAM Minisymposium : Parallel Programming Models and Algorithms for Multicore Clusters and GPGPUs-Part II of III
    • Place of Presentation
      シアトル・アメリカ
    • Year and Date
      2010-02-26
    • Related Report
      2009 Annual Research Report
  • [Presentation] High Performance and High Scalable Eigenvalue Solver on a Peta-scale Computing Environment2010

    • Author(s)
      Imamura, T., Pham, H.P., Yamada, S., Machida, M.
    • Organizer
      SIAM 14th Conference on Parallel Processing for Scientific Computing (PP10), MS42 Joint JSIAM--SIAM Minisymposium : The State-of-the-art of Auto-tuning Technologies : Adaptation to Advanced Computer Environment and Numerical Libraries-Part II of II
    • Place of Presentation
      シアトル・アメリカ
    • Year and Date
      2010-02-25
    • Related Report
      2009 Annual Research Report
  • [Presentation] Automatic Tuning for Parallel 3-D FFT with 2-D Decomposition2010

    • Author(s)
      高橋大介
    • Organizer
      SIAM 14th Conference on Parallel Processing for Scientific Computing (PP10), MS42 Joint JSIAM--SIAM Minisymposium : The State-of-the-art of Auto-tuning Technologies : Adaptation to Advanced Computer Environment and Numerical Libraries-Part II of II
    • Place of Presentation
      シアトル・アメリカ
    • Year and Date
      2010-02-25
    • Related Report
      2009 Annual Research Report
  • [Presentation] Acceleration of two dimensional FMMusing GPU2010

    • Author(s)
      Yoshida, S., and Imamura, T
    • Organizer
      International Workshopon Modern Science and Technology 2010(IWMST 2010)
    • Place of Presentation
      Kitami, Japan
    • Related Report
      2011 Final Research Report
  • [Presentation] Performance Modeling of Multishift QR Algorithms for the Parallel Solution of Symmetric Tridiagonal Eigenvalue Problems2010

    • Author(s)
      Takafumi Miyata, Yusaku Yamamoto and Shao-Liang Zhang
    • Organizer
      M2A2 2010
    • Place of Presentation
      Busan
    • Related Report
      2011 Final Research Report
  • [Presentation] Parallel Multigrid Solvers using Open MP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters2010

    • Author(s)
      Nakajima, K.
    • Organizer
      2nd International Kyoto Forum on Krylov Subspace Methods(2010. 3)
    • Related Report
      2011 Final Research Report
  • [Presentation] Parallel Multigrid Solvers using Open MP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters2010

    • Author(s)
      Nakajima, K.
    • Organizer
      11th Copper Mountain Conference on Iterative Methods
    • Place of Presentation
      USA
    • Related Report
      2011 Final Research Report
  • [Presentation] Software Automatic Tuning : From Concepts to the State-of-the-Art Results2010

    • Author(s)
      R. Suda, K. Naono, K. Teranishiand J. Cavazos
    • Related Report
      2011 Final Research Report
  • [Presentation] An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors2010

    • Author(s)
      Daisuke Takahashi
    • Organizer
      Proc. 8th International Conference on Parallel Processing and Applied Mathematics(PPAM 2009)
    • Related Report
      2011 Final Research Report
  • [Presentation] 並列FFTにおける自動チューニング2010

    • Author(s)
      高橋大介
    • Organizer
      日本応用数理学会2010年度年会講演予稿集
    • Related Report
      2011 Final Research Report
  • [Presentation] A Massively Parallel Dense Symmetric Eigensolver with Communication Splitting Multicasting Algorithm2010

    • Author(s)
      Kataeiri, T., Itoh, S.
    • Organizer
      High Performance Computing for Computational Science-VECPAR 2010
    • Place of Presentation
      Berkley, USA
    • Related Report
      2010 Annual Research Report
  • [Presentation] Parallel Multigrid Solvers using OpenMP/MPI Hybrid Parallel Programining Models on Multi-Core/Multi-Socket Clusters2010

    • Author(s)
      Nakajima, K.
    • Organizer
      High Performance Computing for Computational Science-VECPAR 2010
    • Place of Presentation
      Berkley, USA
    • Related Report
      2010 Annual Research Report
  • [Presentation] Infrastructure for Development of Codes in Scientific Computing on Post-Peta-Scale Systems2010

    • Author(s)
      Nakajima, K
    • Organizer
      Open Workshop by IPAB (Initiative for Parallel Bioinformatics)
    • Place of Presentation
      Nara, Japan
    • Related Report
      2010 Annual Research Report
  • [Presentation] ペタスケール計算環境に向けたFFTライブラリ2010

    • Author(s)
      高橋大介
    • Organizer
      計算工学講演会
    • Place of Presentation
      九州大学
    • Related Report
      2010 Annual Research Report
  • [Presentation] Development of a high performance eigensolver on the peta-scale next generation supercomputer system2010

    • Author(s)
      Imamura, T., Yamada, S., Machida, M
    • Organizer
      Joint International Conference on Supercomputing in Nuclear Applications and Monte Carlo 2010(SNA+MC2010)
    • Place of Presentation
      Tokyo, Japan
    • Related Report
      2010 Annual Research Report
  • [Presentation] Novel approach in a divide and conquer algorithm for eigenvalue problems of real symmetric band matrices2010

    • Author(s)
      Pham, H.P., Imamura.T., Yamada, S., Machida, M.
    • Organizer
      Joint International Conference on Supercomputing in Nuclear Applications and Monte Carlo 2010 (SNA+MC2010)
    • Place of Presentation
      Tokyo, Japan
    • Related Report
      2010 Annual Research Report
  • [Presentation] High Performance Computing of Density Matrix Renormalization Group Method for 2-Dimensional Model : Parallelization Strategy toward Peta Computing2010

    • Author(s)
      Yamada, S., Imamura, T., Okumura, M., Igarashi, R., Onishi, H., Machida, M.
    • Organizer
      Joint International Conference on Supercomputing in Nuclear Applications and Monte Carlo 2010(SNA+MC2010)
    • Place of Presentation
      Tokyo, Japan
    • Related Report
      2010 Annual Research Report
  • [Presentation] Acceleration of two dimensional FMM using GPU2010

    • Author(s)
      Yoshida, S., Imamura.T
    • Organizer
      International Workshop on Modern Science and Technology 2010 (IWMST 2010)
    • Place of Presentation
      Kitami, Japan
    • Related Report
      2010 Annual Research Report
  • [Presentation] ペタスケール環境での高並列固有値ソルバの開発, 日本計算工学会計算工学講演会論文集2010

    • Author(s)
      今村俊幸
    • Organizer
      日本計算工学会
    • Place of Presentation
      九州大学
    • Related Report
      2010 Annual Research Report
  • [Presentation] 拡張階層型領域間境界分割に基づく悪条件問題向け並列前処理手法2010

    • Author(s)
      中島研吾
    • Organizer
      ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2010
    • Place of Presentation
      工学院大学
    • Related Report
      2009 Annual Research Report
  • [Presentation] GPUを利用した2次元FMMの高速化2010

    • Author(s)
      吉田晋三, 今村俊幸
    • Organizer
      日本応用数理学会 2010年 研究部会連合発表会
    • Place of Presentation
      筑波大学
    • Related Report
      2009 Annual Research Report
  • [Presentation] ソフトウェア自動チューニング : パソコンからスパコンまでの先進最適化技術~数値計算ライブラリを中心に~2009

    • Author(s)
      片桐孝洋
    • Organizer
      情報処理学会東北支部、第350回研究講演会
    • Place of Presentation
      秋田県立大学本荘キャンパス
    • Year and Date
      2009-12-18
    • Related Report
      2009 Annual Research Report
  • [Presentation] Cell/B. E.による倍精度粒子法の高速化2009

    • Author(s)
      今村俊幸,木村光宏
    • Organizer
      FAISマルチコアワークショップ2009
    • Year and Date
      2009-10-29
    • Related Report
      2011 Final Research Report
  • [Presentation] Evaluation of Parallel Programming Models for Preconditioned Iterative Solvers on "T2K Open Supercomputer"2009

    • Author(s)
      Nakajima, K.
    • Organizer
      IEEE Proceedings of the 38th International Conference on Parallel Processing(ICPP-09)
    • Related Report
      2011 Final Research Report
  • [Presentation] Auto Tuning Method for Deciding Block Size Parameters in Dynamically Load-balanced BLAS2009

    • Author(s)
      Yuta Sawa, and Reiji Suda
    • Organizer
      Proceedings of 4th international Workshop on Automatic Performance Tuning
    • Related Report
      2011 Final Research Report
  • [Presentation] Flat MPI vs. Hybrid : Evaluation of Parallel Programming Models for Preconditioned Iterative Solvers on "T2K Open Supercomputer2009

    • Author(s)
      中島研吾
    • Organizer
      the 38^<th> International Conference on Parallel Processing (ICPP-09) (Second International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2))
    • Place of Presentation
      ウィーン・オーストリア
    • Related Report
      2009 Annual Research Report
  • [Presentation] An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-Core Processors2009

    • Author(s)
      高橋大介
    • Organizer
      8th International Conference on Parallel Processing and Applied Mathematics (PPAM 2009)
    • Place of Presentation
      ロクロー・ポーランド
    • Related Report
      2009 Annual Research Report
  • [Presentation] Narrow-band reduction approach of a DRSM eigensolver on a multicore-based custer system2009

    • Author(s)
      Imamura, T., Yamada, S., Machida, M.
    • Organizer
      International Parallel Computing conference 2009
    • Place of Presentation
      リヨン・フランス
    • Related Report
      2009 Annual Research Report
  • [Presentation] ペタスケール環境における固有値ソルバについて2009

    • Author(s)
      今村俊幸
    • Organizer
      日本応用数理学会2009年度年会
    • Place of Presentation
      大阪大学豊中キャンパス
    • Related Report
      2009 Annual Research Report
  • [Presentation] 並列計算機におけるソフトウェア自動チューニングのための数理モデル2009

    • Author(s)
      須田礼仁
    • Organizer
      日本応用数理学会2009年度年会
    • Place of Presentation
      大阪大学豊中キャンパス
    • Related Report
      2009 Annual Research Report
  • [Presentation] マルチコア・超並列計算機時代の自動チューニング機能付き疎行列反復解法ソルバ2009

    • Author(s)
      片桐孝洋, 黒田久泰
    • Organizer
      日本応用数理学会2009年度年会
    • Place of Presentation
      大阪大学豊中キャンパス
    • Related Report
      2009 Annual Research Report
  • [Presentation] ペタスケール計算環境に向けたFFTライブラリ2009

    • Author(s)
      高橋大介
    • Organizer
      日本応用数理学会2009年度年会
    • Place of Presentation
      大阪大学豊中キャンパス
    • Related Report
      2009 Annual Research Report
  • [Presentation] 10万超コアを駆使する固有値ソルバについての検討2009

    • Author(s)
      今村俊幸
    • Organizer
      2009年並列/分散/協調処理に関する『仙台』サマー・ワークショップ (SWOPP2009)
    • Place of Presentation
      仙台
    • Related Report
      2009 Annual Research Report
  • [Presentation] マルチコア環境における密および疎行列ソルバの自動チューニング機構の評価2009

    • Author(s)
      片桐孝洋, 黒田久泰
    • Organizer
      2009年並列/分散/協調処理に関する『仙台』サマー・ワークショップ (SWOPP2009)
    • Place of Presentation
      仙台
    • Related Report
      2009 Annual Research Report
  • [Presentation] 大規模並列固有値計算の現状 (ペタスケール計算機環境への展望)2009

    • Author(s)
      今村俊幸, 山田進, 町田昌彦
    • Organizer
      日本学術会議「機械工学委員会, 土木工学・建築学委員会合同IUTAM分科会」
    • Place of Presentation
      東京
    • Related Report
      2009 Annual Research Report
  • [Book] Autotuning Method for Deciding Block Size Parameters in Dynamically Load-balanced BLAS2010

    • Author(s)
      Y. Sawa and R. Suda
    • Publisher
      Springer
    • Related Report
      2011 Final Research Report
  • [Book] Software Automatic Tuning(Concepts and State-of-the-Art Results)2010

    • Author(s)
      Suda, R., Naono, K., Teranishi, K., Cavazos, J.
    • Total Pages
      377
    • Publisher
      Springer Verlag
    • Related Report
      2010 Annual Research Report

URL: 

Published: 2009-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi