• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2019 Fiscal Year Annual Research Report

文字列圧縮と組合せ論による大規模データ管理・処理技法の開発

Research Project

Project/Area Number 18F18120
Research InstitutionKyushu University

Principal Investigator

稲永 俊介  九州大学, システム情報科学研究院, 准教授 (60448404)

Co-Investigator(Kenkyū-buntansha) KOEPPL DOMINIK  九州大学, システム情報科学研究院, 外国人特別研究員
Project Period (FY) 2018-10-12 – 2021-03-31
Keywordsdata structures / algorithms / lossless compression / text indexing
Outline of Annual Research Achievements

One of the major steps towards practically improved data structures was an in-depth analysis of hash tables. Here, we have worked with Shunsuke Kanda and Katsuya Tsuruta on different trie data structures employing hash tables in a clever way to speed up queries, or slim down their space usage. On a more general topic, I (Koeppl) could devise together with Rajeev Raman and Simon Puglisi two compact hash tables, which are optimized for fast construction while using less memory than any other known hash table. These hash tables help to improve associative containers in situations where insertion of big data is the most vital operation. The work with Shunsuke Kanda et al. has been sent to a journal, the work with Katsuya Tsuruta et al. got accepted at DCC'2020, and the work with Rajeev Raman and Simon Puglisi got accepted at SEA'2020.

I (Koeppl) set another research focus on the bijective Burrows-Wheeler transform (BBWT) [Gil and Scott, arXiv 2012]. Here, we devised a self-index on the BBWT, resulting into a conference paper at CPM'2019. Next, we found a connection between the BBWT and suffix sorting, resulting into a linear-time construction algorithm. We published this result on arXiv, and plan to submit the results combined with practical evaluations. To further understand the relation between the BBWT and BWT, together with researcher of Prof. Ayumi Shinohara's laboratory at Touhoku University, we studied conversions between these two transformations, and got the discoveries of this study accepted at CPM'2020.

Current Status of Research Progress
Current Status of Research Progress

2: Research has progressed on the whole more than it was originally planned.

Reason

It is hard to judge whether the current status is delayed or in schedule. Most recent results have been accepted at conferences (twice in DCC 2020, once in CPM 2020, and once in SEA 2020), but there are not yet any proceedings available. I do not think that any of the journal articles I submitted with my colleagues during the JSPS program will get published before the scholarship ends, as the journal publication process in theoretical computer science, especially in renominated journals like Algorithmica or TCS, takes unfortunately very long time. The current results also spark new research questions, which I probably cannot completely answer during the JSPS program.
Overall, I am satisfied with the current research status, and I am confident that the achievements during the two years program will be considered as worthwhile.

Strategy for Future Research Activity

For the following period of six months, I have two projects in mind.
The first is to analyze different tools to speed up and slim down the Lempel-Ziv 78 factorization for which we have elaborated the main tools such as a compact hash table (i.e., the SEA'2020 publication). The plan is to elaborate an exhaustive study submit-able to a journal.
The second is to find new possibilities in indexing integer and real matrices within compressed space. The aim is to augment the computed grammar with an indexing data structure for accelerating common matrix operations such as multiplication. There are currently no sophisticated approaches in how to exploit two-dimensional data by means of a grammar sufficiently. The first objective would be to propose an approach that exploits the shape of the two-dimensional data in such a way that the grammar is much smaller than a string grammar built on the serialization of a matrix. The second objective would be to propose an indexing data structure for common matrix operations that needs less space than the plain matrix while performing an operation faster.
Another line of research in this topic is to study ways of computing already proposed grammars in less time, ideally in optimal time in the word-packing model.

  • Research Products

    (15 results)

All 2020 2019 Other

All Int'l Joint Research (4 results) Journal Article (3 results) (of which Int'l Joint Research: 2 results,  Peer Reviewed: 3 results,  Open Access: 2 results) Presentation (7 results) (of which Int'l Joint Research: 3 results,  Invited: 1 results) Remarks (1 results)

  • [Int'l Joint Research] TU Dortmund/Goethe University Frankfurt(ドイツ)

    • Country Name
      GERMANY
    • Counterpart Institution
      TU Dortmund/Goethe University Frankfurt
  • [Int'l Joint Research] Helsinki University(フィンランド)

    • Country Name
      FINLAND
    • Counterpart Institution
      Helsinki University
  • [Int'l Joint Research] Nicolaus Copernicus University(ポーランド)

    • Country Name
      POLAND
    • Counterpart Institution
      Nicolaus Copernicus University
  • [Int'l Joint Research] University of Leicester(英国)

    • Country Name
      UNITED KINGDOM
    • Counterpart Institution
      University of Leicester
  • [Journal Article] Indexing the Bijective BWT2019

    • Author(s)
      Hideo Bannai, Juha Karkkainen, Dominik Koeppl, Marcin Piatkowski
    • Journal Title

      Proceedings of the 30th Annual Symposium on Combinatorial Pattern Matching - CPM 2019

      Volume: 128 in LIPIcs series Pages: 17:1-17:14

    • DOI

      https://doi.org/10.4230/LIPIcs.CPM.2019.17

    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Bidirectional Text Compression in External Memory2019

    • Author(s)
      Patrick Dinklage, Jonas Ellert, Johannes Fischer, Dominik Koeppl, Manuel Penschuck
    • Journal Title

      Proceedings of the 27th Annual European Symposium on Algorithms - ESA 2019

      Volume: 144 in LIPIcs series Pages: 41:1-41:16

    • DOI

      https://doi.org/10.4230/LIPIcs.ESA.2019.41

    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Compact Data Structures for Shortest Unique Substring Queries2019

    • Author(s)
      Takuya Mieno, Dominik Koeppl, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
    • Journal Title

      Proceedings of the 26th International Symposium on String Processing and Information Retrieval - SPIRE 2019

      Volume: 11811 in LNCS Pages: 107-123

    • DOI

      https://doi.org/10.1007/978-3-030-32686-9_8

    • Peer Reviewed
  • [Presentation] Constructing the Bijective BWT2020

    • Author(s)
      Dominik Koeppl
    • Organizer
      The 28th London Stringology Days & London Algorithmic Workshop - LAWS&LSD 2020
    • Int'l Joint Research
  • [Presentation] In-Place Bijective Burrows Wheeler Transformations2020

    • Author(s)
      Dominik Koeppl, Daiki Hashimoto, Diptarama Hendrian and Ayumi Shinohara
    • Organizer
      Data Structures in Bioinformatics workshop - DSB2020
    • Int'l Joint Research
  • [Presentation] Constructing the Bijective BWT2019

    • Author(s)
      Hideo Bannai, Juha Karkkainen, Dominik Koeppl, Marcin Piatkowski
    • Organizer
      175th アルゴリズム研究会, 2019
  • [Presentation] Dominik Koeppl, 井 智弘, 古谷 勇, 高畠 嘉将, 酒井 健輔, 後藤 啓介2019

    • Author(s)
      Re-Pair In-Place
    • Organizer
      LA Symposium Summer 2019
  • [Presentation] Separate Chaining Meets Compact Hashing2019

    • Author(s)
      Dominik Koeppl
    • Organizer
      173th アルゴリズム研究会
  • [Presentation] Dynamic Trie Tailored for Fast Prefix Searches2019

    • Author(s)
      鶴田 和弥, Dominik Koeppl, 神田 峻介, 中島 祐人, 稲永 俊介, 坂内 英夫, 竹田 正幸
    • Organizer
      LA Symposium Summer 2019
  • [Presentation] Searching Patterns in the Bijective BWT2019

    • Author(s)
      Dominik Koeppl
    • Organizer
      Dagstuhl Seminar 19241 "25 Years of the Burrows-Wheeler Transform
    • Int'l Joint Research / Invited
  • [Remarks] Homepage of Dominik Koeppl

    • URL

      https://dkppl.de/

URL: 

Published: 2021-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi