• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Parallel Computation Theory for Memory Machine Models and Next Generation GPGPU Architecture

Research Project

Project/Area Number 26280002
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypePartial Multi-year Fund
Section一般
Research Field Theory of informatics
Research InstitutionHiroshima University

Principal Investigator

Nakano Koji  広島大学, 工学研究科, 教授 (30281075)

Co-Investigator(Kenkyū-buntansha) 高藤 大介  広島大学, 工学研究科, 助教 (00314732)
伊藤 靖朗  広島大学, 工学研究科, 准教授 (40397964)
Project Period (FY) 2014-04-01 – 2019-03-31
Project Status Completed (Fiscal Year 2018)
Budget Amount *help
¥15,990,000 (Direct Cost: ¥12,300,000、Indirect Cost: ¥3,690,000)
Fiscal Year 2018: ¥3,120,000 (Direct Cost: ¥2,400,000、Indirect Cost: ¥720,000)
Fiscal Year 2017: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2016: ¥3,120,000 (Direct Cost: ¥2,400,000、Indirect Cost: ¥720,000)
Fiscal Year 2015: ¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2014: ¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000)
KeywordsParallel Algorithms / GPGPU / Memory Machine Models / 並列処理 / 理論計算モデル / 並列アルゴリズム / 並列計算機 / ハイパフォーマンス・コンピューティング / 計算機システム / 計算モデル
Outline of Final Research Achievements

The GPU (Graphics Processing Unit) is a specialized circuit designed to accelerate computation for building and manipulating images. Latest GPUs are designed for general purpose computing and can perform computation in applications traditionally handled by the CPU. The main purpose of this research is to propose appropriate theoretical models for GPU computing, develop efficient parallel algorithms based on the theoretical models, and evaluate the performance. We have developed theoretical models, Discrete Memory Machine model, Unified Memory Machine model, and Hierarchical Memory Machine model which capture the essence of memory access to the shared memory and the global memory of the GPU. Based on these models, we have developed many efficient algorithms on the GPU. In particular, we have developed a new technique that we call SKSS (Single Kernel Soft Synchronization) technique. We have shown that this technique can be applied to accelerate dynamic programming algorithms on the GPU.

Academic Significance and Societal Importance of the Research Achievements

GPUの理論的モデルを提案することにより,理論研究者がGPU向けアルゴリズムを研究するためのベースを提供することができた.これまでは,並列アルゴリズムの理論研究者にとってGPU向け並列アルゴリズムの実装作業は困難であったが,これにより,GPU上での並列処理技法の研究が容易に行えるようになった.また,このモデルをベースに研究代表者らはGPUのいくつかの具体的なアルゴリズム手法,例えば,SKSS (Single Kernel Soft Synchronization)などを提案し,その有効性をGPUへの実装実験により実証することができた.

Report

(6 results)
  • 2018 Annual Research Report   Final Research Report ( PDF )
  • 2017 Annual Research Report
  • 2016 Annual Research Report
  • 2015 Annual Research Report
  • 2014 Annual Research Report
  • Research Products

    (51 results)

All 2018 2017 2016 2015 2014 Other

All Int'l Joint Research (4 results) Journal Article (15 results) (of which Int'l Joint Research: 3 results,  Peer Reviewed: 15 results,  Open Access: 3 results) Presentation (30 results) (of which Int'l Joint Research: 22 results) Funded Workshop (2 results)

  • [Int'l Joint Research] ブラジリア大学(ブラジル)

    • Related Report
      2018 Annual Research Report
  • [Int'l Joint Research] ブラジリア大学(ブラジル)

    • Related Report
      2017 Annual Research Report
  • [Int'l Joint Research] ブラジリア大学(ブラジル)

    • Related Report
      2016 Annual Research Report
  • [Int'l Joint Research] ブラジリア大学(ブラジル)

    • Related Report
      2015 Annual Research Report
  • [Journal Article] Almost Optimal Column-wise Prefix-sum Computation on the GPU2018

    • Author(s)
      Tokura Hiroki、Fujita Toru、Nakano Koji、Ito Yasuaki
    • Journal Title

      The Journal of Supercomputing

      Volume: 74 Pages: 224-233

    • DOI

      10.1007/978-3-319-78054-2_21

    • ISBN
      9783319780535, 9783319780542
    • Related Report
      2018 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Accelerating digital halftoning using the local exhaustive search on the GPU2017

    • Author(s)
      Hiroaki Kouge, Takumi Honda, Toru Fujita, Yasuaki Ito, Koji Nakano, Jacir Luiz Bordim
    • Journal Title

      Concurrency and Computation: Practice and Experience

      Volume: 29 Issue: 2

    • DOI

      10.1002/cpe.3781

    • Related Report
      2017 Annual Research Report 2016 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] C2CU: a CUDA C program generator for bulk execution of a sequential algorithm2017

    • Author(s)
      Daisuke Takafuji, Koji Nakano, Yasuaki Ito, Jacir Luiz Bordim
    • Journal Title

      Concurrency and Computation: Practice and Experience

      Volume: 29 Issue: 17

    • DOI

      10.1002/cpe.4022

    • NAID

      110009971685

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Adaptive loss-less data compression method optimized for GPU decompression2017

    • Author(s)
      Shunji Funasaka, Koji Nakano, Yasuaki Ito:
    • Journal Title

      Concurrency and Computation: Practice and Experience

      Volume: 29 Issue: 24

    • DOI

      10.1002/cpe.4283

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An Efficient GPU Implementation of CKY Parsing Using the Bitwise Parallel Bulk Computation Technique2017

    • Author(s)
      Toru Fujita, Koji Nakano, Yasuaki Ito, Daisuke Takafuji
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E100.D Issue: 12 Pages: 2857-2865

    • DOI

      10.1587/transinf.2017PAP0018

    • NAID

      130006236562

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] GPU-accelerated Exhaustive Verification of the Collatz Conjecture2017

    • Author(s)
      Takumi Honda, Yasuaki Ito, Koji Nakano
    • Journal Title

      International Journal of Networking and Computing

      Volume: 7 Issue: 1 Pages: 69-85

    • DOI

      10.15803/ijnc.7.1_69

    • NAID

      130005331502

    • ISSN
      2185-2839, 2185-2847
    • Related Report
      2017 Annual Research Report 2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An Efficient GPU Implementation of Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices2017

    • Author(s)
      Hiroki Tokura, Takumi Honda, Yasuaki Ito, Koji Nakano, Mitsuya Nishino, Yushiro Hirota, Masami Saeki
    • Journal Title

      International Journal of Networking and Computing

      Volume: 7 Issue: 2 Pages: 227-247

    • DOI

      10.15803/ijnc.7.2_227

    • NAID

      130006135389

    • ISSN
      2185-2839, 2185-2847
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Fast Simulation of Conway's Game of Life using Bitwise Parallel Bulk Computation on a GPU2016

    • Author(s)
      Toru Fujita, Koji Nakano, and Yasuaki Ito
    • Journal Title

      International Journal of Foundations of Computer Science

      Volume: 27 Issue: 08 Pages: 981-1003

    • DOI

      10.1142/s0129054116500404

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Memory-Access-Efficient Implementation for Computing the Approximate String Matching Algorithm on GPUs2016

    • Author(s)
      Lucas Saad Nogueira Numes, Jacir Luiz Bordim, Koji Nakano, and Yasuaki Ito
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E99.D Issue: 12 Pages: 2995-3003

    • DOI

      10.1587/transinf.2016PAP0024

    • NAID

      130005171004

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] GPU-Accelerated Bulk Execution of Multiple-Length Multiplication with Warp-Synchronous Programming Technique2016

    • Author(s)
      Takumi Honda, Yasuaki Ito, and Koji Nakano
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E99.D Issue: 12 Pages: 3004-3012

    • DOI

      10.1587/transinf.2016PAP0027

    • NAID

      130005170988

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Bulk execution of Euclidean algorithms on the CUDA-enabled GPU2016

    • Author(s)
      Toru Fujita, Koji Nakano, Yasuaki Ito
    • Journal Title

      International Journal of Networking and Computing

      Volume: 6 Issue: 1 Pages: 42-63

    • DOI

      10.15803/ijnc.6.1_42

    • NAID

      130005123823

    • ISSN
      2185-2839, 2185-2847
    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] A character art generator using the local exhaustive search, with GPU acceleration2016

    • Author(s)
      Yuji Takeuchi, Koji Nakano, Daisuke Takafuji, Yasuaki Ito
    • Journal Title

      International Journal of Parallel, Emergent and Distributed Systems

      Volume: 31 Issue: 1 Pages: 3063-3071

    • DOI

      10.1080/17445760.2014.962026

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An Optimal Implementation of the Approximate String Matching on the Hierarchical Memory Machine, with Performance Evaluation on the GPU2014

    • Author(s)
      Duhu MAN, Koji NAKANO, Yasuaki ITO
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E97.D Issue: 12 Pages: 3063-3071

    • DOI

      10.1587/transinf.2014PAP0011

    • NAID

      130004841764

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2014 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Offline Permutation on the CUDA-enabled GPU2014

    • Author(s)
      Akihiko KASAGI, Koji NAKANO, Yasuaki ITO
    • Journal Title

      IEICE Transactions on Information and Systems

      Volume: E97.D Issue: 12 Pages: 3052-3062

    • DOI

      10.1587/transinf.2014PAP0010

    • NAID

      130004841763

    • ISSN
      0916-8532, 1745-1361
    • Related Report
      2014 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Accelerating ant colony optimisation for the travelling salesman problem on the GPU2014

    • Author(s)
      Akihiro Uchida, Yasuaki Ito, Koji Nakano
    • Journal Title

      International Journal of Parallel, Emergent and Distributed Systems

      Volume: 29 Issue: 4 Pages: 401-420

    • DOI

      10.1080/17445760.2013.842568

    • Related Report
      2014 Annual Research Report
    • Peer Reviewed
  • [Presentation] An Optimal Parallel Algorithm for Computing the Summed Area Table on the GPU2018

    • Author(s)
      Yutaro Emoto, Shunji Funasaka, Hiroki Tokura, Takumi Honda, Koji Nakano, Yasuaki Ito
    • Organizer
      Workshop on Advances in Parallel and Distributed Computational Models
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Tile Art Image Generation Using Conditional Generative Adversarial Networks2018

    • Author(s)
      Naoki Matsumura, Hiroki Tokura, Yuki Kuroda, Yasuaki Ito, Koji Nakano
    • Organizer
      Sixth International Symposium on Computing and Networking Workshops
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Prefix-Sum-Based Rabin-Karp Implementation for Multiple Pattern Matching on GPGPU2018

    • Author(s)
      Lucas Saad N. Nunes, Jacir Luiz Bordim, Yasuaki Ito, Koji Nakano
    • Organizer
      Sixth International Symposium on Computing and Networking
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Efficient Byte Stream Pattern Test using Bloom Filter with Rolling Hash Functions on the FPGA2018

    • Author(s)
      Takuma Wada, Naoki Matsumura, Koji Nakano, Yasuaki Ito
    • Organizer
      Sixth International Symposium on Computing and Networking
    • Related Report
      2018 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Single Kernel Soft Synchronization Technique for Task Arrays on CUDA-enabled GPUs, with Applications2017

    • Author(s)
      Shunji Funasaka, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Square Pointillism Image Generation, and Its GPU Acceleration2017

    • Author(s)
      Hiroki Tokura, Yuki Kuroda, Yasuaki Ito, Koji Nakano
    • Organizer
      International Symposium on Computing and Networking
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations2017

    • Author(s)
      Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, Yasuaki Ito
    • Organizer
      International Conference on Parallel Processing
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU2017

    • Author(s)
      Takahiro Nishimura, Jacir Luiz Bordim, Yasuaki Ito, Koji Nakano
    • Organizer
      International Parallel and Distributed Processing Symposium Workshops
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Photomosaic Generation by Rearranging Subimages, with GPU Acceleration2017

    • Author(s)
      Yi Yang, Yasuaki Ito, Koji Nakano
    • Organizer
      International Parallel and Distributed Processing Symposium Workshops
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Almost Optimal Column-wise Prefix-sum Computation on the GPU2017

    • Author(s)
      Hiroki Tokura, Toru Fujita, Koji Nakano, Yasuaki Ito
    • Organizer
      International Conference on Parallel Processing and Applied Mathematics
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A GPU Implementation of Bulk Execution of the Dynamic Programming for the Optimal Polygon Triangulation2017

    • Author(s)
      Kohei Yamashita, Yasuaki Ito, Koji Nakano
    • Organizer
      International Conference on Parallel Processing and Applied Mathematics
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Light Loss-Less Data Compression, with GPU implementation2016

    • Author(s)
      Shunji Funasaka, Koji Nakano and Yasuaki Ito
    • Organizer
      International Conference on Algorithms and Architectures for Parallel Processing
    • Place of Presentation
      スペイン
    • Year and Date
      2016-12-14
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Accelerating Ant Colony Optimization for the Vertex Coloring Problem on the GPU2016

    • Author(s)
      Ryouhei Murooka, Yasuaki Ito and Koji Nakano
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      広島
    • Year and Date
      2016-11-22
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Efficient GPU implementations for the Conway's Game of Life2015

    • Author(s)
      Toru Fujita, Daigo Nishikori, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A flexible-length-arithmetic processor based on FDFM approach in FPGAs2015

    • Author(s)
      Tatsuya Kawamoto, Yasuaki Ito, Koji Nakano
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Parallelization Techniques for Error Diffusion with GPU Implementations2015

    • Author(s)
      Akihiko Kasagi, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Fast Approximate String Matching Algorithm on GPU2015

    • Author(s)
      Lucas Saad, Jacir Bordim, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Warp-synchronous Implementation for Multiple-length Multiplication on the GPU2015

    • Author(s)
      Takumi Honda, Yasuaki Ito, Koji Nakano
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Fast LZW compression using a GPU2015

    • Author(s)
      Shunji Funasaka, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      Sapporo, Japan
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] A Parallel Algorithm for LZW Decompression, with GPU Implementation2015

    • Author(s)
      Shunji Funasaka, Koji Nakano, Yasuaki Ito
    • Organizer
      International Conference on Parallel Processing and Applied Mathematics
    • Place of Presentation
      Krakow, Poland
    • Year and Date
      2015-09-06
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] GPU-Accelerated Digital Halftoning by the Local Exhaustive Search2015

    • Author(s)
      Hiroaki Kouge, Yasuaki Ito, Koji Nakano
    • Organizer
      International Symposium on Parallel and Distributed Computing
    • Place of Presentation
      Limassol, Cyprus
    • Year and Date
      2015-06-29
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Bulk GCD Computation Using a GPU to Break Weak RSA Keys2015

    • Author(s)
      Toru Fujita, Koji Nakano and Yasuaki Ito
    • Organizer
      Workshop on Advances in Parallel and Distributed Computational Models
    • Place of Presentation
      Hyderabad, India
    • Year and Date
      2015-05-25
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Optimality of Fundamental Parallel Algorithms on the Hierarchical Memory Machine, with GPU implementation2015

    • Author(s)
      Koji Nakano and Yasuaki Ito
    • Organizer
      International Conference on Parallel, Distributed and Network-Based Processing
    • Place of Presentation
      フィンランド,ツルク
    • Year and Date
      2015-03-04 – 2015-03-06
    • Related Report
      2014 Annual Research Report
  • [Presentation] A Time Optimal Parallel Algorithm for the Dynamic Programming on the Hierarchical Memory Machine2014

    • Author(s)
      Koji Nakano
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      静岡
    • Year and Date
      2014-12-10 – 2014-12-12
    • Related Report
      2014 Annual Research Report
  • [Presentation] Thorough Evaluation of GPU Shared Memory Load and Store Instructions2014

    • Author(s)
      Satoshi Okamoto, Yasuaki Ito, Koji Nakano, Jacir L. Bordim
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      静岡
    • Year and Date
      2014-12-10 – 2014-12-12
    • Related Report
      2014 Annual Research Report
  • [Presentation] Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations2014

    • Author(s)
      Akihiko Kasagi, Koji Nakano, Yasuaki Ito
    • Organizer
      International Symposium on Computing and Networking
    • Place of Presentation
      米国,ミネアポリス
    • Year and Date
      2014-09-09 – 2014-09-12
    • Related Report
      2014 Annual Research Report
  • [Presentation] Random Address Permute Shift Technique for the Shared Memory on GPUs2014

    • Author(s)
      Koji Nakano, Susumu Matsumae, Yasuaki Ito
    • Organizer
      International Conference on Parallel Processing
    • Place of Presentation
      米国,ミネアポリス
    • Year and Date
      2014-09-09 – 2014-09-12
    • Related Report
      2014 Annual Research Report
  • [Presentation] A GPU Implementation of Clipping-Free Halftoning using the Direct Binary Search2014

    • Author(s)
      Hiroaki Kouge, Yasuaki Ito and Koji Nakano
    • Organizer
      International Conference on Algorithms and Architectures for Parallel Processing
    • Place of Presentation
      中国,大連
    • Year and Date
      2014-08-24 – 2014-08-27
    • Related Report
      2014 Annual Research Report
  • [Presentation] A CUDA C Program Generator for Bulk Execution of a Sequential Algorithm2014

    • Author(s)
      Daisuke Takafuji, Koji Nakano and Yasuaki Ito
    • Organizer
      International Conference on Algorithms and Architectures for Parallel Processing
    • Place of Presentation
      中国,大連
    • Year and Date
      2014-08-24 – 2014-08-27
    • Related Report
      2014 Annual Research Report
  • [Presentation] Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU Implementation2014

    • Author(s)
      Kazuya Tani, Daisuke Takafuji, Koji Nakano, Yasuaki Ito
    • Organizer
      International Parallel and Distributed Processing Symposium Workshops
    • Place of Presentation
      米国,フェニックス
    • Year and Date
      2014-05-19 – 2014-05-23
    • Related Report
      2014 Annual Research Report
  • [Funded Workshop] Sixth International Symposium on Computing and Networking2018

    • Related Report
      2018 Annual Research Report
  • [Funded Workshop] International Symposium on networking and Computing2015

    • Place of Presentation
      札幌
    • Year and Date
      2015-12-08
    • Related Report
      2015 Annual Research Report

URL: 

Published: 2014-04-04   Modified: 2020-03-30  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi