• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Resource-Constraint Privacy-Aware Data Structures Tackling Problems in Bioinformatics

Publicly Offered Research

Project AreaCreation and Organization of Innovative Algorithmic Foundations for Leading Social Innovations
Project/Area Number 21H05847
Research Category

Grant-in-Aid for Transformative Research Areas (A)

Allocation TypeSingle-year Grants
Review Section Transformative Research Areas, Section (IV)
Research InstitutionTokyo Medical and Dental University

Principal Investigator

Koeppl Dominik  東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)

Project Period (FY) 2021-09-10 – 2023-03-31
Project Status Completed (Fiscal Year 2022)
Budget Amount *help
¥5,200,000 (Direct Cost: ¥4,000,000、Indirect Cost: ¥1,200,000)
Fiscal Year 2022: ¥2,600,000 (Direct Cost: ¥2,000,000、Indirect Cost: ¥600,000)
Fiscal Year 2021: ¥2,600,000 (Direct Cost: ¥2,000,000、Indirect Cost: ¥600,000)
Keywordsdata compression / genetic data indexes / resource constraints / text indexing / matching statistics / parameterized matching / suffix array access / privacy-aware computing / factorization algorithms / LZ78 compression / lexicographic parse / sparse suffix sorting / grammar compression / compressed data / memory-efficiency / hashing / biological data indexing / space-efficiency / privacy-aware / lossless compression / compressed indexing
Outline of Research at the Start

Recent advances in technology has made it possible to collect vast amounts of biological data valuable for studying genetic diseases and devising individually targeted therapies. Unfortunately, while the collection of such data has gathered high momentum, we are unaware of solutions that can cope with the collected data efficiently while supporting biologically important queries under the restriction that privacy is respected. Such a solution can make it possible to discover insights into diseases and side effects of medical treatments caused by genetic variations.

Outline of Annual Research Achievements

For indexing biological data meaningful, we presented at SPIRE'22 two new approaches: The first is an augmentation of the r-index for improving the time for random accesses in the suffix array. This is usually done by a sequential application of the Phi-Array. This method has been experienced as slow in practice. We therefore could slightly improve the time by simulating the predecessor queries with a walk on a labelled graph, on which we can omit some of the predecessor queries. The second is for parameterized pattern matching, which is an extension of classic pattern matching. Here, we proposed the first efficient algorithm for computing the parameterized Burrows-Wheeler transform online.
When it comes to computing matching statistics, we could practically improve the time for the computation with the r-index augmented with some helper data structures, in detail: a grammar with longest common extension (LCE) query support, and the thresholds array. While Bannai et al. [TCS'20] showed how to compute matching statistics with the r-index, we provided two successive improvements with a software called PHONI two years ago, and with a recent practical improvement by skipping some LCE queries by storing additional LCE values of the thresholds. We can justify this small space increase with a remarkable improvement in the query time since the LCE queries answered by the used grammar tend to be the bottleneck of the whole algorithm.

Research Progress Status

令和4年度が最終年度であるため、記入しない。

Strategy for Future Research Activity

令和4年度が最終年度であるため、記入しない。

Report

(2 results)
  • 2022 Annual Research Report
  • 2021 Annual Research Report
  • Research Products

    (39 results)

All 2023 2022 2021 Other

All Int'l Joint Research (8 results) Journal Article (22 results) (of which Int'l Joint Research: 22 results,  Peer Reviewed: 22 results,  Open Access: 12 results) Presentation (7 results) (of which Int'l Joint Research: 1 results) Remarks (2 results)

  • [Int'l Joint Research] University of Florida/Illumina Inc(米国)

    • Related Report
      2022 Annual Research Report
  • [Int'l Joint Research] University of Pisa(イタリア)

    • Related Report
      2022 Annual Research Report
  • [Int'l Joint Research] Lodz University of Technology/University of Piemonte Orientale(ポーランド)

    • Related Report
      2022 Annual Research Report
  • [Int'l Joint Research] Nicolaus Copernicus University(ポーランド)

    • Related Report
      2021 Annual Research Report
  • [Int'l Joint Research] University of Glasgow/University of Leicester(英国)

    • Related Report
      2021 Annual Research Report
  • [Int'l Joint Research] Millennium Institute/Tecnica Federico Santa Maria/University of Chile(チリ)

    • Related Report
      2021 Annual Research Report
  • [Int'l Joint Research] Baker Heart and Diabetes Institute(オーストラリア)

    • Related Report
      2021 Annual Research Report
  • [Int'l Joint Research] National Tsing Hua University(台湾)

    • Related Report
      2021 Annual Research Report
  • [Journal Article] Dynamic Skyline Computation with LSD Trees2023

    • Author(s)
      Dominik Koeppl
    • Journal Title

      Analytics

      Volume: 2 Issue: 1 Pages: 146-162

    • DOI

      10.3390/analytics2010009

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Space-efficient Huffman codes revisited2023

    • Author(s)
      Szymon Grabowski and Dominik Koeppl
    • Journal Title

      Information Processing Letters

      Volume: 179 Pages: 1-8

    • DOI

      10.1016/j.ipl.2022.106274

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Graph Compression for Adjacency-Matrix Multiplication2022

    • Author(s)
      Alexandre P. Francisco and Travis Gagie and Dominik Koeppl and Susana Ladra and Gonzalo Navarro
    • Journal Title

      SN Computer Science

      Volume: 3 Issue: 3 Pages: 1-8

    • DOI

      10.1007/s42979-022-01084-2

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Computing Longest (Common) Lyndon Subsequences2022

    • Author(s)
      Hideo Bannai, Tomohiro I, Tomasz Kociumaka, Dominik Koeppl, Simon J. Puglisi
    • Journal Title

      Proc. 33rd International Workshop on Combinatorial Algorithms (IWOCA) 2022

      Volume: - Pages: 128-142

    • DOI

      10.1007/978-3-031-06678-8_10

    • ISBN
      9783031066771, 9783031066788
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Space-Efficient B Trees via Load-Balancing2022

    • Author(s)
      Tomohiro I, Dominik Koeppl
    • Journal Title

      Proc. 33rd International Workshop on Combinatorial Algorithms (IWOCA) 2022

      Volume: - Pages: 327-340

    • DOI

      10.1007/978-3-031-06678-8_24

    • ISBN
      9783031066771, 9783031066788
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Linking Off-Road Points to Routing Networks2022

    • Author(s)
      Dominik Koeppl
    • Journal Title

      Algorithms

      Volume: 15(5) Issue: 5 Pages: 1-15

    • DOI

      10.3390/a15050163

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Fast and Simple Compact Hashing via Bucketing2022

    • Author(s)
      Dominik Koeppl and Simon J. Puglisi and Rajeev Raman
    • Journal Title

      Algorithmica

      Volume: 84 Issue: 9 Pages: 2735-2766

    • DOI

      10.1007/s00453-022-00996-y

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Computing the Parameterized Burrows-Wheeler Transform Online2022

    • Author(s)
      Daiki Hashimoto and Diptarama Hendrian and Dominik Koeppl and Ryo Yoshinaka and Ayumi Shinohara
    • Journal Title

      Proceedings of SPIRE

      Volume: 13617 Pages: 70-85

    • DOI

      10.1007/978-3-031-20643-6_6

    • ISBN
      9783031206429, 9783031206436
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Accessing the Suffix Array via $\phi^-1$-Forest2022

    • Author(s)
      Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
    • Journal Title

      Proceedings of SPIRE

      Volume: 13617 Pages: 86-98

    • DOI

      10.1007/978-3-031-20643-6_7

    • ISBN
      9783031206429, 9783031206436
    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Computing NP-hard Repetitiveness Measures via MAX-SAT2022

    • Author(s)
      Hideo Bannai and Keisuke Goto and Masakazu Ishihata and Shunsuke Kanda and Dominik Koeppl and Takaaki Nishimoto
    • Journal Title

      Proceedings of ESA

      Volume: 244

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices2022

    • Author(s)
      Paolo Ferragina and Giovanni Manzini and Travis Gagie and Dominik Koeppl and Gonzalo Navarro and Manuel Striani and Francesco Tosoni
    • Journal Title

      Proc. VLDB

      Volume: 15 Issue: 10 Pages: 2175-2187

    • DOI

      10.14778/3547305.3547321

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] FM-Indexing Grammars Induced by Suffix Sorting for Long Patterns2022

    • Author(s)
      Jin Jie Deng and Wing-Kai Hon and Dominik Koeppl and Kunihiko Sadakane
    • Journal Title

      Proc. DCC

      Volume: 83--92 Pages: 63-72

    • DOI

      10.1109/dcc52660.2022.00014

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] HOLZ: High-Order Entropy Encoding of {Lempel--Ziv} Factor Distances2022

    • Author(s)
      Dominik Koeppl and Gonzalo Navarro and Nicola Prezza
    • Journal Title

      Proc. DCC

      Volume: 2022 Pages: 83-92

    • DOI

      10.1109/dcc52660.2022.00016

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Computing Lexicographic Parsings2022

    • Author(s)
      Koeppl Dominik
    • Journal Title

      Proc. DCC

      Volume: 2022 Pages: 232-241

    • DOI

      10.1109/dcc52660.2022.00031

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Inferring Spatial Distance Rankings with Partial Knowledge on Routing Networks2022

    • Author(s)
      Koeppl Dominik
    • Journal Title

      Information

      Volume: 13 Issue: 4 Pages: 168-168

    • DOI

      10.3390/info13040168

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] c-trie++: A dynamic trie tailored for fast prefix searches2021

    • Author(s)
      Kazuya Tsuruta, Dominik Koeppl, Shunsuke Kanda, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
    • Journal Title

      Information and Computation

      Volume: - Pages: 104794-104794

    • DOI

      10.1016/j.ic.2021.104794

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Reversed Lempel-Ziv Factorization with Suffix Trees2021

    • Author(s)
      Koeppl Dominik
    • Journal Title

      Algorithms

      Volume: 14 Issue: 6 Pages: 161-161

    • DOI

      10.3390/a14060161

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Constructing the Bijective and the Extended Burrows-Wheeler Transform in Linear Time2021

    • Author(s)
      Hideo Bannai and Juha Kaerkkaeinen and Dominik Koeppl and Marcin Piト?tkowski
    • Journal Title

      Proceedings of CPM

      Volume: 191

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Extracting the Sparse Longest Common Prefix Array from the Suffix Binary Search Tree2021

    • Author(s)
      I Tomohiro、Irving Robert、Koeppl Dominik、Love Lorna
    • Journal Title

      Proc. SPIRE

      Volume: 12944 Pages: 143-150

    • DOI

      10.1007/978-3-030-86692-1_12

    • ISBN
      9783030866914, 9783030866921
    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Grammar Index by Induced Suffix Sorting2021

    • Author(s)
      Tooru Akagi, Dominik Koeppl, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
    • Journal Title

      Proceedings of 28th International Symposium on String Processing and Information Retrieval

      Volume: 12944 Pages: 85-99

    • DOI

      10.1007/978-3-030-86692-1_8

    • ISBN
      9783030866914, 9783030866921
    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] A Separation of $$\gamma $$ and b via Thue-Morse Words2021

    • Author(s)
      Bannai Hideo、Funakoshi Mitsuru、I Tomohiro、Koeppl Dominik、Mieno Takuya、Nishimoto Takaaki
    • Journal Title

      Proceedings of the 28th International Symposium on String Processing and Information Retrieval (SPIRE 2021)

      Volume: LNCS 12944 Pages: 167-178

    • DOI

      10.1007/978-3-030-86692-1_14

    • ISBN
      9783030866914, 9783030866921
    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Engineering Practical Lempel-Ziv Tries2021

    • Author(s)
      Diego Arroyuelo and Rodrigo Cテ。novas and Johannes Fischer and Dominik Koeppl and Marvin Loebel and Gonzalo Navarro and Rajeev Raman
    • Journal Title

      ACM JEA

      Volume: 26 Pages: 1-47

    • DOI

      10.1145/3481638

    • Related Report
      2021 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] r インデックスにおける接尾辞配列を模倣するデータ構造2023

    • Author(s)
      Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
    • Organizer
      Local Proceedings of the LA Symposium Winter 2022
    • Related Report
      2022 Annual Research Report
  • [Presentation] アルファベット順による lex-parse サイズ比2023

    • Author(s)
      中島 祐人 and クップル ドミニク and 舩越 満 and 稲永 俊介
    • Organizer
      Local Proceedings of the 191th アルゴリズム研究会
    • Related Report
      2022 Annual Research Report
  • [Presentation] 接尾辞木に基づくLZ77とLPF配列の変種の計算2022

    • Author(s)
      クップル ドミニク
    • Organizer
      Local Proceedings of コンピュテーション研究会
    • Related Report
      2022 Annual Research Report
  • [Presentation] Lempel-Ziv 項の距離を高次情報量で表現する符号2022

    • Author(s)
      Dominik Koeppl and Gonzalo Navarro and Nicola Prezza
    • Organizer
      Local Proceedings of the 190th アルゴリズム研究会
    • Related Report
      2022 Annual Research Report
  • [Presentation] SATソルバを用いたNP困難な圧縮指標の高速計算2022

    • Author(s)
      坂内 英夫 and 後藤 啓介 and 石畠 正和 and 神田 峻介 and クップル ドミニク and 西本 崇晃
    • Organizer
      人工知能学会研究会資料 人工知能基本問題研究会
    • Related Report
      2021 Annual Research Report
  • [Presentation] 省領域な lexicographic parse 構築アルゴリズム2021

    • Author(s)
      クップル ドミニク
    • Organizer
      Local Proceedings of コンピュテーション研究会
    • Related Report
      2021 Annual Research Report
  • [Presentation] Computation of Variations of the LZ77 factorization and the LPF Array with Suffix Trees2021

    • Author(s)
      Dominik Koeppl
    • Organizer
      WCTA
    • Related Report
      2021 Annual Research Report
    • Int'l Joint Research
  • [Remarks] Private Homepage

    • URL

      https://dkppl.de/

    • Related Report
      2022 Annual Research Report
  • [Remarks] Personal Homepage

    • URL

      https://dkppl.de/

    • Related Report
      2021 Annual Research Report

URL: 

Published: 2021-10-22   Modified: 2023-12-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi