2022 年度実績報告書

Resource-Constraint Privacy-Aware Data Structures Tackling Problems in Bioinformatics

公募研究

研究領域	社会変革の源泉となる革新的アルゴリズム基盤の創出と体系化
研究課題/領域番号	21H05847
研究機関	東京医科歯科大学
研究代表者	Koeppl Dominik 東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)
研究期間 (年度)	2021-09-10 – 2023-03-31
キーワード	data compression / genetic data indexes / resource constraints / text indexing / matching statistics / parameterized matching / suffix array access
研究実績の概要	For indexing biological data meaningful, we presented at SPIRE'22 two new approaches: The first is an augmentation of the r-index for improving the time for random accesses in the suffix array.　This is usually done by a sequential application of the Phi-Array. This method has been experienced as slow in practice. We therefore could slightly improve the time by simulating the predecessor queries with a walk on a labelled graph, on which we can omit some of the predecessor queries. The second is for parameterized pattern matching, which is an extension of classic pattern matching. Here, we proposed the first efficient algorithm for computing the parameterized Burrows-Wheeler transform online. When it comes to computing matching statistics, we could practically improve the time for the computation with the r-index augmented with some helper data structures, in detail: a grammar with longest common extension (LCE) query support, and the thresholds array. While Bannai et al. [TCS'20] showed how to compute matching statistics with the r-index, we provided two successive improvements with a software called PHONI two years ago, and with a recent practical improvement by skipping some LCE queries by storing additional LCE values of the thresholds. We can justify this small space increase with a remarkable improvement in the query time since the LCE queries answered by the used grammar tend to be the bottleneck of the whole algorithm.
現在までの達成度 (段落)	令和4年度が最終年度であるため、記入しない。
今後の研究の推進方策	令和4年度が最終年度であるため、記入しない。

研究成果
(20件)

すべて 2023 2022 その他

すべて国際共同研究 (3件) 雑誌論文 (12件) (うち国際共著 12件、査読あり 12件、オープンアクセス 7件) 学会発表 (4件) 備考 (1件)

[国際共同研究] University of Florida/Illumina Inc(米国)
- 国名
  米国
- 外国機関名
  University of Florida/Illumina Inc
[国際共同研究] University of Pisa(イタリア)
- 国名
  イタリア
- 外国機関名
  University of Pisa
[国際共同研究] Lodz University of Technology/University of Piemonte Orientale(ポーランド)
- 国名
  ポーランド
- 外国機関名
  Lodz University of Technology/University of Piemonte Orientale
[雑誌論文] Dynamic Skyline Computation with LSD Trees2023
- 著者名/発表者名
  Dominik Koeppl
- 雑誌名
  
  Analytics
  
  巻: 2 ページ: 146-162
- DOI
  10.3390/analytics2010009
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Space-efficient Huffman codes revisited2023
- 著者名/発表者名
  Szymon Grabowski and Dominik Koeppl
- 雑誌名
  
  Information Processing Letters
  
  巻: 179 ページ: 1-8
- DOI
  10.1016/j.ipl.2022.106274
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] c-trie++: A dynamic trie tailored for fast prefix searches2022
- 著者名/発表者名
  Kazuya Tsuruta and Dominik Koeppl and Shunsuke Kanda and Yuto Nakashima and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda
- 雑誌名
  
  Inf. Comput.
  
  巻: 285 Part B ページ: 1-22
- DOI
  10.1016/j.ic.2021.104794
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Graph Compression for Adjacency-Matrix Multiplication2022
- 著者名/発表者名
  Alexandre P. Francisco and Travis Gagie and Dominik Koeppl and Susana Ladra and Gonzalo Navarro
- 雑誌名
  
  SN Computer Science
  
  巻: 3 ページ: 1-8
- DOI
  10.1007/s42979-022-01084-2
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Computing Longest (Common) Lyndon Subsequences2022
- 著者名/発表者名
  Hideo Bannai and Tomohiro I and Tomasz Kociumaka and Dominik Koeppl and Simon J. Puglisi
- 雑誌名
  
  Proceedings of IWOCA
  
  巻: 13270 ページ: 128-142
- DOI
  10.1007/978-3-031-06678-8_10
- 査読あり / 国際共著
[雑誌論文] Space-Efficient B Trees via Load-Balancing2022
- 著者名/発表者名
  Tomohiro I and Dominik Koeppl
- 雑誌名
  
  Proceedings of IWOCA
  
  巻: 13270 ページ: 327-340
- DOI
  10.1007/978-3-031-06678-8_24
- 査読あり / 国際共著
[雑誌論文] Linking Off-Road Points to Routing Networks2022
- 著者名/発表者名
  Dominik Koeppl
- 雑誌名
  
  Algorithms
  
  巻: 15(5) ページ: 1-15
- DOI
  10.3390/a15050163
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Fast and Simple Compact Hashing via Bucketing2022
- 著者名/発表者名
  Dominik Koeppl and Simon J. Puglisi and Rajeev Raman
- 雑誌名
  
  Algorithmica
  
  巻: 84 ページ: 2735-2766
- DOI
  10.1007/s00453-022-00996-y
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Computing the Parameterized Burrows-Wheeler Transform Online2022
- 著者名/発表者名
  Daiki Hashimoto and Diptarama Hendrian and Dominik Koeppl and Ryo Yoshinaka and Ayumi Shinohara
- 雑誌名
  
  Proceedings of SPIRE
  
  巻: 13617 ページ: 70-85
- DOI
  10.1007/978-3-031-20643-6_6
- 査読あり / 国際共著
[雑誌論文] Accessing the Suffix Array via $\phi^-1$-Forest2022
- 著者名/発表者名
  Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
- 雑誌名
  
  Proceedings of SPIRE
  
  巻: 13617 ページ: 86-98
- DOI
  10.1007/978-3-031-20643-6_7
- 査読あり / 国際共著
[雑誌論文] Computing NP-hard Repetitiveness Measures via MAX-SAT2022
- 著者名/発表者名
  Hideo Bannai and Keisuke Goto and Masakazu Ishihata and Shunsuke Kanda and Dominik Koeppl and Takaaki Nishimoto
- 雑誌名
  
  Proceedings of ESA
  
  巻: 244 ページ: 12:1-12:16
- DOI
  10.4230/LIPIcs.ESA.2022.12
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices2022
- 著者名/発表者名
  Paolo Ferragina and Giovanni Manzini and Travis Gagie and Dominik Koeppl and Gonzalo Navarro and Manuel Striani and Francesco Tosoni
- 雑誌名
  
  Proc. VLDB
  
  巻: 15 ページ: 2175-2187
- DOI
  10.14778/3547305.3547321
- 査読あり / 国際共著
[学会発表] r インデックスにおける接尾辞配列を模倣するデータ構造2023
- 著者名/発表者名
  Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
- 学会等名
  Local Proceedings of the LA Symposium Winter 2022
[学会発表] アルファベット順による lex-parse サイズ比2023
- 著者名/発表者名
  中島祐人 and クップルドミニク and 舩越満 and 稲永俊介
- 学会等名
  Local Proceedings of the 191th アルゴリズム研究会
[学会発表] 接尾辞木に基づくLZ77とLPF配列の変種の計算2022
- 著者名/発表者名
  クップルドミニク
- 学会等名
  Local Proceedings of コンピュテーション研究会
[学会発表] Lempel-Ziv 項の距離を高次情報量で表現する符号2022
- 著者名/発表者名
  Dominik Koeppl and Gonzalo Navarro and Nicola Prezza
- 学会等名
  Local Proceedings of the 190th アルゴリズム研究会
[備考] Private Homepage
- URL
  https://dkppl.de/

2022 年度 実績報告書

Resource-Constraint Privacy-Aware Data Structures Tackling Problems in Bioinformatics

研究代表者

Koeppl Dominik 東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)

研究成果

[国際共同研究] University of Florida/Illumina Inc(米国)

国名

外国機関名

[国際共同研究] University of Pisa(イタリア)

国名

外国機関名

[国際共同研究] Lodz University of Technology/University of Piemonte Orientale(ポーランド)

国名

外国機関名

[雑誌論文] Dynamic Skyline Computation with LSD Trees2023

著者名/発表者名

雑誌名

DOI

[雑誌論文] Space-efficient Huffman codes revisited2023

著者名/発表者名

雑誌名

DOI

[雑誌論文] c-trie++: A dynamic trie tailored for fast prefix searches2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Graph Compression for Adjacency-Matrix Multiplication2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Computing Longest (Common) Lyndon Subsequences2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Space-Efficient B Trees via Load-Balancing2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Linking Off-Road Points to Routing Networks2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Fast and Simple Compact Hashing via Bucketing2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Computing the Parameterized Burrows-Wheeler Transform Online2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Accessing the Suffix Array via $\phi^-1$-Forest2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Computing NP-hard Repetitiveness Measures via MAX-SAT2022

著者名/発表者名

雑誌名

DOI

[雑誌論文] Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices2022

著者名/発表者名

雑誌名

DOI

[学会発表] r インデックスにおける接尾辞配列を模倣するデータ構造2023

著者名/発表者名

学会等名

[学会発表] アルファベット順による lex-parse サイズ比2023

著者名/発表者名

学会等名

[学会発表] 接尾辞木に基づくLZ77とLPF配列の変種の計算2022

著者名/発表者名

学会等名

[学会発表] Lempel-Ziv 項の距離を高次情報量で表現する符号2022

著者名/発表者名

学会等名

[備考] Private Homepage

URL

2022 年度実績報告書