• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

文字列圧縮と組合せ論による大規模データ管理・処理技法の開発

Research Project

Project/Area Number 18F18120
Research Category

Grant-in-Aid for JSPS Fellows

Allocation TypeSingle-year Grants
Section外国
Research Field Theory of informatics
Research InstitutionTokyo Medical and Dental University (2020)
Kyushu University (2018-2019)

Principal Investigator

稲永 俊介 (2018-2019)  九州大学, システム情報科学研究院, 准教授 (60448404)

Co-Investigator(Kenkyū-buntansha) Koeppl Dominik  東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)
KOEPPL DOMINIK  九州大学, システム情報科学研究院, 外国人特別研究員
Project Period (FY) 2018-10-12 – 2021-03-31
Project Status Completed (Fiscal Year 2020)
Budget Amount *help
¥1,400,000 (Direct Cost: ¥1,400,000)
Fiscal Year 2020: ¥400,000 (Direct Cost: ¥400,000)
Fiscal Year 2019: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 2018: ¥400,000 (Direct Cost: ¥400,000)
Keywordsdata structures / algorithms / lossless compression / hashing / アルゴリズム / データ構造 / 文字列データ処理 / tries / text indexing
Outline of Annual Research Achievements

The focus of this research was set on (a) practical and dynamic trie data structures, (b) the computation of the grammar compression Re-Pair in small space, and (c) advancements for the bijective Burrows-Wheeler transform (BBWT), a variant of the Burrows-Wheeler transform (BWT) well received in theory as well as in practice for indexing string data.
(a) We have devised a novel approach for compact hashing, which is the most memory-efficient approach in practice when working with a huge number of integer keys of a bounded domain. Based on this approach, we have proposed dynamic trie data structures working with path-decomposition or with trie compaction.
(b) Re-Pair, a grammar with high compression ratios, is difficult to compute within limited amount of memory. Here, we could find a quadratic time algorithm computing Re-Pair with almost no additional space. We also devised an index data structure build upon a grammar representing the Lyndon tree. This index exploits several properties of the Lyndon words to improve the running time of the currently fastest grammar index from a quadratic factor on the pattern length to a linear one.
(c) Finally, we could build an indexing data structure on top of the BBWT, compute the BBWT in-place or transform the BWT into the BBWT, and finally build the BBWT in linear time.
Asides from that, we could find space-efficient factorization algorithms for the non-overlapping LZ77 factorization and the LZ78 substring compression problem. These algorithms work in near-linear time with space asymptotic to the input text length in bits.

Research Progress Status

令和2年度が最終年度であるため、記入しない。

Strategy for Future Research Activity

令和2年度が最終年度であるため、記入しない。

Report

(3 results)
  • 2020 Annual Research Report
  • 2019 Annual Research Report
  • 2018 Annual Research Report
  • Research Products

    (43 results)

All 2021 2020 2019 Other

All Int'l Joint Research (11 results) Journal Article (18 results) (of which Int'l Joint Research: 17 results,  Peer Reviewed: 18 results,  Open Access: 9 results) Presentation (12 results) (of which Int'l Joint Research: 8 results,  Invited: 1 results) Remarks (2 results)

  • [Int'l Joint Research] TU Dortmund/German Aerospace Center/Universitaet Stuttgart(ドイツ)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] Dalhousie University(カナダ)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Leicester/Aberystwyth University(英国)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Helsinki(フィンランド)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Chile(チリ)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] TU Dortmund/Goethe University Frankfurt(ドイツ)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] Helsinki University(フィンランド)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] Nicolaus Copernicus University(ポーランド)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] University of Leicester(英国)

    • Related Report
      2019 Annual Research Report
  • [Int'l Joint Research] Nicolaus Copernicus University(ポーランド)

    • Related Report
      2018 Annual Research Report
  • [Int'l Joint Research] University of Helsinki(フィンランド)

    • Related Report
      2018 Annual Research Report
  • [Journal Article] PHONI: Streamed Matching Statistics with Multi-Genome References2021

    • Author(s)
      Christina Boucher and Travis Gagie and Tomohiro I and Dominik Koeppl and Ben Langmead and Giovanni Manzini and Gonzalo Navarro and Alejandro Pacheco and Massimiliano Rossi
    • Journal Title

      Proc. DCC

      Volume: -

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Non-Overlapping LZ77 Factorization and LZ78 Substring Compression Queries with Suffix Trees2021

    • Author(s)
      Dominik Koeppl
    • Journal Title

      Algorithms

      Volume: 14(2) Issue: 2 Pages: 44-44

    • DOI

      10.3390/a14020044

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Fast and Simple Compact Hashing via Bucketing2020

    • Author(s)
      Dominik Koeppl and Simon J. Puglisi and Rajeev Raman
    • Journal Title

      Proc. SEA in LIPIcs

      Volume: 160

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Re-Pair in Small Space2020

    • Author(s)
      Dominik Koeppl and Tomohiro I and Isamu Furuya and Yoshimasa Takabatake and Kensuke Sakai and Keisuke Goto
    • Journal Title

      Proc. PSC

      Volume: -

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Re-Pair in Small Space (Poster)2020

    • Author(s)
      Dominik Koeppl and Tomohiro I and Isamu Furuya and Yoshimasa Takabatake and Kensuke Sakai and Keisuke Goto
    • Journal Title

      Proc. DCC

      Volume: - Pages: 377-377

    • DOI

      10.1109/dcc47342.2020.00092

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] c-Trie++: A Dynamic Trie Tailored for Fast Prefix Searches2020

    • Author(s)
      Tsuruta Kazuya, Koppl Dominik, Kanda Shunsuke, Nakashima Yuto, Inenaga Shunsuke, Bannai Hideo, Takeda Masayuki
    • Journal Title

      Proceedings of Data Compression Conference 2020

      Volume: 1 Pages: 243-252

    • DOI

      10.1109/dcc47342.2020.00032

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Computational Aspects of Ordered Integer Partition with Bounds2020

    • Author(s)
      Roland Glueck and Dominik Koeppl
    • Journal Title

      Algorithmica

      Volume: 82 Issue: 10 Pages: 2955-2984

    • DOI

      10.1007/s00453-020-00713-7

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] In-Place Bijective Burrows--Wheeler Transforms2020

    • Author(s)
      Dominik Koeppl and Daiki Hashimoto and Diptarama Hendrian and Ayumi Shinohara
    • Journal Title

      Proc. CPM in LIPIcs

      Volume: 161

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Deterministic Sparse Suffix Sorting in the Restore Model2020

    • Author(s)
      Johannes Fischer, Tomohiro I, Dominik Koeppl
    • Journal Title

      ACM Transactions on Algorithms

      Volume: 16(4) Issue: 4 Pages: 1-53

    • DOI

      10.1145/3398681

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Grammar-compressed Self-index with Lyndon Words2020

    • Author(s)
      Kazuya Tsuruta and Dominik Koeppl and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda
    • Journal Title

      IPSJ TOM

      Volume: 13

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Dynamic Path-Decomposed Tries2020

    • Author(s)
      Shunsuke Kanda and Dominik Koeppl and Yasuo Tabei and Kazuhiro Morita and Masao Fuketa
    • Journal Title

      ACM JEA

      Volume: 25 Pages: 1-28

    • DOI

      10.1145/3418033

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Space-efficient algorithms for computing minimal/shortest unique substrings2020

    • Author(s)
      Mieno Takuya, Koppl Dominik, Nakashima Yuto, Inenaga Shunsuke, Bannai Hideo, Takeda Masayuki
    • Journal Title

      Theoretical Computer Science

      Volume: 845 Pages: 230-242

    • DOI

      10.1016/j.tcs.2020.09.017

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] On Arithmetically Progressed Suffix Arrays2020

    • Author(s)
      Jacqueline W. Daykin and Dominik Koeppl and David Kuebel and Florian Stober
    • Journal Title

      Proc. PSC

      Volume: -

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Re-Pair in Small Space2020

    • Author(s)
      Dominik Koeppl, Tomohiro I, Isamu Furuya, Yoshimasa Takabatake, Kensuke Sakai, Keisuke Goto,
    • Journal Title

      Algorithms

      Volume: 14(1) Issue: 1 Pages: 1-20

    • DOI

      10.3390/a14010005

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Indexing the Bijective BWT2019

    • Author(s)
      Hideo Bannai, Juha Karkkainen, Dominik Koeppl, Marcin Piatkowski
    • Journal Title

      Proceedings of the 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019)

      Volume: LIPIcs 128

    • DOI

      10.4230/LIPICS.CPM.2019.17

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Bidirectional Text Compression in External Memory2019

    • Author(s)
      Patrick Dinklage, Jonas Ellert, Johannes Fischer, Dominik Koeppl, Manuel Penschuck
    • Journal Title

      Proceedings of the 27th Annual European Symposium on Algorithms - ESA 2019

      Volume: 144 in LIPIcs series

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Compact data structure for shortest unique substring queries2019

    • Author(s)
      Takuya Mieno, Dominik Koeppl, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
    • Journal Title

      Proceedings of 26th International Symposium on String Processing and Information Retrieval, Lecture Notes in Computer Science

      Volume: 11811 Pages: 107-123

    • DOI

      10.1007/978-3-030-32686-9_8

    • ISBN
      9783030326852, 9783030326869
    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Indexing the Bijective BWT2019

    • Author(s)
      Hideo Bannai, Juha Karkkainen, Dominik Koeppl and Marcin Piatkowski
    • Journal Title

      Proc. 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019)

      Volume: to appear

    • Related Report
      2018 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] PHONI: Streamed Matching Statistics with Multi-Genome References2021

    • Author(s)
      Dominik Koeppl
    • Organizer
      DCC
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Fast and Simple Compact Hashing via Bucketing2020

    • Author(s)
      Dominik Koeppl
    • Organizer
      SEA
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Re-Pair in Small Space2020

    • Author(s)
      Dominik Koeppl
    • Organizer
      PSC
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] c-Trie++: A Dynamic Trie Tailored for Fast Prefix Searches2020

    • Author(s)
      Kazuya Tsuruta and Dominik Koeppl
    • Organizer
      DCC
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] In-Place Bijective Burrows--Wheeler Transforms2020

    • Author(s)
      Dominik Koeppl
    • Organizer
      CPM
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Constructing the Bijective BWT2020

    • Author(s)
      Dominik Koeppl
    • Organizer
      The 28th London Stringology Days & London Algorithmic Workshop - LAWS&LSD 2020
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] In-Place Bijective Burrows Wheeler Transformations2020

    • Author(s)
      Dominik Koeppl, Daiki Hashimoto, Diptarama Hendrian and Ayumi Shinohara
    • Organizer
      Data Structures in Bioinformatics workshop - DSB2020
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Constructing the Bijective BWT2019

    • Author(s)
      Hideo Bannai, Juha Karkkainen, Dominik Koeppl, Marcin Piatkowski
    • Organizer
      175th アルゴリズム研究会, 2019
    • Related Report
      2019 Annual Research Report
  • [Presentation] Dominik Koeppl, 井 智弘, 古谷 勇, 高畠 嘉将, 酒井 健輔, 後藤 啓介2019

    • Author(s)
      Re-Pair In-Place
    • Organizer
      LA Symposium Summer 2019
    • Related Report
      2019 Annual Research Report
  • [Presentation] Separate Chaining Meets Compact Hashing2019

    • Author(s)
      Dominik Koeppl
    • Organizer
      173th アルゴリズム研究会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Dynamic Trie Tailored for Fast Prefix Searches2019

    • Author(s)
      鶴田 和弥, Dominik Koeppl, 神田 峻介, 中島 祐人, 稲永 俊介, 坂内 英夫, 竹田 正幸
    • Organizer
      LA Symposium Summer 2019
    • Related Report
      2019 Annual Research Report
  • [Presentation] Searching Patterns in the Bijective BWT2019

    • Author(s)
      Dominik Koeppl
    • Organizer
      Dagstuhl Seminar 19241 "25 Years of the Burrows-Wheeler Transform
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Remarks] personal homepage

    • URL

      https://dkppl.de/

    • Related Report
      2020 Annual Research Report
  • [Remarks] Homepage of Dominik Koeppl

    • URL

      https://dkppl.de/

    • Related Report
      2019 Annual Research Report

URL: 

Published: 2018-10-15   Modified: 2024-03-26  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi