• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

大量データ処理のための領域効率の良いアルゴリズム

Research Project

Project/Area Number 16092222
Research Category

Grant-in-Aid for Scientific Research on Priority Areas

Allocation TypeSingle-year Grants
Review Section Science and Engineering
Research InstitutionKyushu University

Principal Investigator

定兼 邦彦  Kyushu University, 大学院・システム情報科学研究院, 准教授 (20323090)

Co-Investigator(Kenkyū-buntansha) 山下 雅史  九州大学, 大学院・システム情報科学研究院, 教授 (00135419)
小野 廣隆  九州大学, 大学院・システム情報科学研究院, 助教 (00346826)
Project Period (FY) 2004 – 2007
Project Status Completed (Fiscal Year 2007)
Budget Amount *help
¥13,000,000 (Direct Cost: ¥13,000,000)
Fiscal Year 2007: ¥3,200,000 (Direct Cost: ¥3,200,000)
Fiscal Year 2006: ¥3,400,000 (Direct Cost: ¥3,400,000)
Fiscal Year 2005: ¥3,500,000 (Direct Cost: ¥3,500,000)
Fiscal Year 2004: ¥2,900,000 (Direct Cost: ¥2,900,000)
Keywords領域計算量 / グラフ探索 / 簡潔データ構造 / データ圧縮 / エントロピー / スケールフリーグラフ / 逆探索 / 情報検索 / データ構造 / 接尾辞配列 / 圧縮接尾辞配列 / 文字列検索 / 文字列圧縮 / 省スペース / オンラインアルゴリズム / Web検索 / 並列化
Research Abstract

本研究では大量データ処理のための領域効率の良いアルゴリズムの開発を行った.大量データ処理を行う際の問題点としては,処理時間だけではなく,データの格納やアルゴリズムの動作に必要な記憶領域の大きさがある.よって本研究では領域効率の良いアルゴリズムの開発に重点を置き,以下のアブローチを行った.(1)データを圧縮する:データ自身やデータ構造を圧縮して保存する簡潔データ構造を開発し,少ないメモリで大量のデータを扱えるようにした.(2)データの一部分のみをメモリに格納するアルゴリズム:全体のデータ量は多いが,同時にはその限られた部分のデータのみを使用するようなアルゴリズムならば限られたメモリでも動作する.本研究では特に,グラフの探索を行う省スペースアルゴリズムを開発した.その結果,大量データ処理のための基礎となるアルゴリズムとデータ構造を開発できた.
全てのデータは文字列とみなす事ができる.その文字列の長さをn,文字のアルファベットサイズをσとすると,データの圧縮前のサイズはnlog_2 σビットとなる.通常のデータ圧縮法ではこの文字列をnH_k+o(n log σ)ビットに圧縮するが,その一部分のみアクセスしたい場合でも圧縮されたデータ全体を復元する必要がある.本研究の圧縮法では,圧縮された文字列の任意の部分を高速に復元できる.具体的には,文字列中の位置が指定されたとき,その位置から始まる部分文字列で長さがlog nビットのものを定数時間で復元することができる.圧縮サイズはnH_k+o(n log σ)ビットであり従来法と漸近的に等しい.

Report

(4 results)
  • 2007 Annual Research Report
  • 2006 Annual Research Report
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • Research Products

    (23 results)

All 2008 2007 2006 2005 2004 Other

All Journal Article (21 results) (of which Peer Reviewed: 3 results) Presentation (1 results) Remarks (1 results)

  • [Journal Article] Compressed indexes for dynamic text collections2008

    • Author(s)
      H-L, Chan・W-K, Hon・T-W, Lam・K., Sadakane
    • Journal Title

      ACM Transactions on Algorithms 3(2)(in press)

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Compressed Dynamic Tries with Applications to LZ-Compression in Sublinear Time and Space2007

    • Author(s)
      Jesper, Jansson・Kunihiko, Sadakane・Wing-Kin, Sung:
    • Journal Title

      International Conference on the Foundations of Software Technology and Theoretical Computer Science LNCS 4855

      Pages: 424-435

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Space and Time Efficient Algorithm for Constructing Compressed Suffix Arrays2007

    • Author(s)
      W-K., Hon・T-W, Lam・K., Sadakane・W-K, Sung・S-M, Yiu:
    • Journal Title

      Algorithmica 48(1)

      Pages: 23-36

    • Related Report
      2007 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Ultra-succinct Representation of Ordered Trees2007

    • Author(s)
      Jesper Jansson, Kunihiko Sadakane, Wing-Kin Sung
    • Journal Title

      Proceedings of ACM-SIAM Symposium on Discrete Algorithms

      Pages: 575-584

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Forest Search : A Paradigm for Faster Exploration of Scale-Free Networks2006

    • Author(s)
      Y.Kurumida, H.Ono, K.Sadakane, M.Yamashita
    • Journal Title

      Proc. of the 4th International Symposium on Parallel and Distributed Processing and Applications LNCS 4330

      Pages: 39-50

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Reducing the space requirement of LZ-index2006

    • Author(s)
      D.Arroyuelo, G.Navarro, K.Sadakane
    • Journal Title

      Proceedings of Combinatorial Pattern Matching LNCS 4009

      Pages: 307-318

    • Related Report
      2006 Annual Research Report
  • [Journal Article] A Probabilistic Model of the DNA Conformational Change2006

    • Author(s)
      M.Shiozaki, H.Ono, K.Sadakane, M.Yamashita
    • Journal Title

      Proceedings of DNA LNCS 4287

      Pages: 274-285

    • NAID

      120006655004

    • Related Report
      2006 Annual Research Report
  • [Journal Article] DNA Sequence Design by Dynamic Neighborhood Searches2006

    • Author(s)
      S.Kawashimo, H.Ono, K.Sadakane, M.Yamashita
    • Journal Title

      Proceedings of DNA LNCS 4287

      Pages: 157-171

    • NAID

      110004741455

    • Related Report
      2006 Annual Research Report
  • [Journal Article] A generic search strategy for large-scale real-world networks2006

    • Author(s)
      Y.Kurumida, T.Ogata, H.Ono, K.Sadakane, M.Yamashita
    • Journal Title

      Proc. INFOSCALE, ACM International Conference Proceeding Series Vol. 152

      Pages: 53-53

    • NAID

      120006654471

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Rooted Maximum Agreement Supertrees2005

    • Author(s)
      J.Jansson, H.-K.Ng, K.Sadakane, W.-K.Sung
    • Journal Title

      Algorithmica Vol.43, No.4

      Pages: 293-307

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Energy-Optimal Online Algorithms for Broadcasting in Wireless Networks2005

    • Author(s)
      S.Kutten, H.Ono, D.Peleg, K.Sadakane, M.Yamashita
    • Journal Title

      IEEE The Second Annual Conference on Wireless On demand Network Systems and Services

      Pages: 125-130

    • Related Report
      2005 Annual Research Report
  • [Journal Article] A Local Search Based Barrier Height Estimation Algorithm for DNA Molecular Transitions2005

    • Author(s)
      T.Takeda, H.Ono, K.Sadakane, M.Yamashita
    • Journal Title

      Preproceedings of DNA Computing, 11th International Workshop on DNA Based Computers

      Pages: 354-365

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Energy-Optimal Online Algorithms for Broadcasting in Wireless Networks2005

    • Author(s)
      S.Kutten, H.Ono, D.Peleg, K.Sadakane, M.Yamashita
    • Journal Title

      Second Annual Conference on Wireless On-demand Network Systems and Services, Proceedings, IEEE Computer Society

      Pages: 125-160

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Dynamic Dictionary Matching and Compressed Suffix Trees2005

    • Author(s)
      Ho-Leung Chan, Wing-Kai Hon, Tak-Wah Lam, Kunihiko Sadakane
    • Journal Title

      ACM-SIAM Symposium on Discrete Algorithms PR118

      Pages: 13-22

    • Related Report
      2004 Annual Research Report
  • [Journal Article] A Web Scoring Method for Local Web Search Engines2004

    • Author(s)
      Yohei Ikawa, Kunihiko Sadakane
    • Journal Title

      Database Systems for Advances Applications, 9th International Comference, DASFAA 2004, Lecture notes in Computer Science 2973

      Pages: 606-617

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Neighborhood Composition : A Parallelization of Local Search Algorithms2004

    • Author(s)
      Yuichi Handa, Hirotaka Ono, Kunihiko Sadakane, Masafumi Yamashita
    • Journal Title

      Recent Advances in Parallel Virtual Machine and Message Passing Interface, Proceedings. Lecture Notes in Computer Science 3241

      Pages: 155-163

    • NAID

      130004607933

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Advantages of Backward Searching - Efficient Secondary Memory and Distributed Implementation of Compressed Suffix Arrays2004

    • Author(s)
      Veli Makinen, Gonzalo Navarro, Kunihiko Sadakane
    • Journal Title

      15th International Symposium on Algorithms and Computation. Lecture Notes in Computer Science 3241

      Pages: 681-692

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Rooted Maximum Agreement Supertrees2004

    • Author(s)
      Jesper Jansson, Joseph H.-K.Ng, Kunihiko Sadakane, Wing-Kin Sung
    • Journal Title

      Theoretical Informatics, 6th Latin American Symposium, Proceedings. Lecture Notes in Computer Science 2976

      Pages: 499-508

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Succinct Data Structures for Flexible Text Retrieval Systems

    • Author(s)
      K.Sadakane
    • Journal Title

      Journal of Discrete Algorithms (accepted)

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Compressed Indexes for Dynamic Text Collections

    • Author(s)
      H.-L.Chan, W.-K.Hon, T.-W.Lam, K.Sadakane
    • Journal Title

      ACM Transactions on Algorithms (accepted)

    • Related Report
      2005 Annual Research Report
  • [Journal Article] A Space and Time Efficient Algorithm for Constructing Compressed Suffix Arrays

    • Author(s)
      W.-K.Hon, T.-W.Lam, K.Sadakane, W.-K.Sung, S.M.Yiu
    • Journal Title

      Algorithmica (accepted)

    • Related Report
      2005 Annual Research Report
  • [Presentation] Ultra-succinct representation of ordered trees2007

    • Author(s)
      Kunihiko, Sadakane
    • Organizer
      ACM-SIAM Symposium on Discrete Algorithms
    • Place of Presentation
      アメリカ, ニューオーリンズ
    • Year and Date
      2007-01-08
    • Related Report
      2007 Annual Research Report
  • [Remarks]

    • URL

      http://tcslab.csce.kyushu-u.ac.jp/~sada/

    • Related Report
      2007 Annual Research Report

URL: 

Published: 2004-04-01   Modified: 2018-03-28  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi