Resource-Constraint Privacy-Aware Data Structures Tackling Problems in Bioinformatics

公募研究

研究領域	社会変革の源泉となる革新的アルゴリズム基盤の創出と体系化
研究課題/領域番号	21H05847
研究種目	学術変革領域研究(A)
配分区分	補助金
審査区分	学術変革領域研究区分(Ⅳ)
研究機関	東京医科歯科大学
研究代表者	Koeppl Dominik 東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)
研究期間 (年度)	2021-09-10 – 2023-03-31
研究課題ステータス	完了 (2022年度)
配分額 *注記	5,200千円 (直接経費: 4,000千円、間接経費: 1,200千円) 2022年度: 2,600千円 (直接経費: 2,000千円、間接経費: 600千円) 2021年度: 2,600千円 (直接経費: 2,000千円、間接経費: 600千円)
キーワード	data compression / genetic data indexes / resource constraints / text indexing / matching statistics / parameterized matching / suffix array access / privacy-aware computing / factorization algorithms / LZ78 compression / lexicographic parse / sparse suffix sorting / grammar compression / compressed data / memory-efficiency / hashing / biological data indexing / space-efficiency / privacy-aware / lossless compression / compressed indexing
研究開始時の研究の概要	Recent advances in technology has made it possible to collect vast amounts of biological data valuable for studying genetic diseases and devising individually targeted therapies. Unfortunately, while the collection of such data has gathered high momentum, we are unaware of solutions that can cope with the collected data efficiently while supporting biologically important queries under the restriction that privacy is respected. Such a solution can make it possible to discover insights into diseases and side effects of medical treatments caused by genetic variations.
研究実績の概要	For indexing biological data meaningful, we presented at SPIRE'22 two new approaches: The first is an augmentation of the r-index for improving the time for random accesses in the suffix array.　This is usually done by a sequential application of the Phi-Array. This method has been experienced as slow in practice. We therefore could slightly improve the time by simulating the predecessor queries with a walk on a labelled graph, on which we can omit some of the predecessor queries. The second is for parameterized pattern matching, which is an extension of classic pattern matching. Here, we proposed the first efficient algorithm for computing the parameterized Burrows-Wheeler transform online. When it comes to computing matching statistics, we could practically improve the time for the computation with the r-index augmented with some helper data structures, in detail: a grammar with longest common extension (LCE) query support, and the thresholds array. While Bannai et al. [TCS'20] showed how to compute matching statistics with the r-index, we provided two successive improvements with a software called PHONI two years ago, and with a recent practical improvement by skipping some LCE queries by storing additional LCE values of the thresholds. We can justify this small space increase with a remarkable improvement in the query time since the LCE queries answered by the used grammar tend to be the bottleneck of the whole algorithm.
現在までの達成度 (段落)	令和4年度が最終年度であるため、記入しない。
今後の研究の推進方策	令和4年度が最終年度であるため、記入しない。

報告書

(2件)

2022 実績報告書
2021 実績報告書

研究成果
(39件)

すべて 2023 2022 2021 その他

すべて国際共同研究 (8件) 雑誌論文 (22件) (うち国際共著 22件、査読あり 22件、オープンアクセス 12件) 学会発表 (7件) (うち国際学会 1件) 備考 (2件)

[国際共同研究] University of Florida/Illumina Inc(米国)
- 関連する報告書
  2022 実績報告書
[国際共同研究] University of Pisa(イタリア)
- 関連する報告書
  2022 実績報告書
[国際共同研究] Lodz University of Technology/University of Piemonte Orientale(ポーランド)
- 関連する報告書
  2022 実績報告書
[国際共同研究] Nicolaus Copernicus University(ポーランド)
- 関連する報告書
  2021 実績報告書
[国際共同研究] University of Glasgow/University of Leicester(英国)
- 関連する報告書
  2021 実績報告書
[国際共同研究] Millennium Institute/Tecnica Federico Santa Maria/University of Chile(チリ)
- 関連する報告書
  2021 実績報告書
[国際共同研究] Baker Heart and Diabetes Institute(オーストラリア)
- 関連する報告書
  2021 実績報告書
[国際共同研究] National Tsing Hua University(台湾)
- 関連する報告書
  2021 実績報告書
[雑誌論文] Dynamic Skyline Computation with LSD Trees2023
- 著者名/発表者名
  Dominik Koeppl
- 雑誌名
  
  Analytics
  
  巻: 2 号: 1 ページ: 146-162
- DOI
  10.3390/analytics2010009
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Space-efficient Huffman codes revisited2023
- 著者名/発表者名
  Szymon Grabowski and Dominik Koeppl
- 雑誌名
  
  Information Processing Letters
  
  巻: 179 ページ: 1-8
- DOI
  10.1016/j.ipl.2022.106274
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Graph Compression for Adjacency-Matrix Multiplication2022
- 著者名/発表者名
  Alexandre P. Francisco and Travis Gagie and Dominik Koeppl and Susana Ladra and Gonzalo Navarro
- 雑誌名
  
  SN Computer Science
  
  巻: 3 号: 3 ページ: 1-8
- DOI
  10.1007/s42979-022-01084-2
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Computing Longest (Common) Lyndon Subsequences2022
- 著者名/発表者名
  Hideo Bannai, Tomohiro I, Tomasz Kociumaka, Dominik Koeppl, Simon J. Puglisi
- 雑誌名
  
  Proc. 33rd International Workshop on Combinatorial Algorithms (IWOCA) 2022
  
  巻: －ページ: 128-142
- DOI
  10.1007/978-3-031-06678-8_10
- ISBN
  9783031066771, 9783031066788
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] Space-Efficient B Trees via Load-Balancing2022
- 著者名/発表者名
  Tomohiro I, Dominik Koeppl
- 雑誌名
  
  Proc. 33rd International Workshop on Combinatorial Algorithms (IWOCA) 2022
  
  巻: －ページ: 327-340
- DOI
  10.1007/978-3-031-06678-8_24
- ISBN
  9783031066771, 9783031066788
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] Linking Off-Road Points to Routing Networks2022
- 著者名/発表者名
  Dominik Koeppl
- 雑誌名
  
  Algorithms
  
  巻: 15(5) 号: 5 ページ: 1-15
- DOI
  10.3390/a15050163
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Fast and Simple Compact Hashing via Bucketing2022
- 著者名/発表者名
  Dominik Koeppl and Simon J. Puglisi and Rajeev Raman
- 雑誌名
  
  Algorithmica
  
  巻: 84 号: 9 ページ: 2735-2766
- DOI
  10.1007/s00453-022-00996-y
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Computing the Parameterized Burrows-Wheeler Transform Online2022
- 著者名/発表者名
  Daiki Hashimoto and Diptarama Hendrian and Dominik Koeppl and Ryo Yoshinaka and Ayumi Shinohara
- 雑誌名
  
  Proceedings of SPIRE
  
  巻: 13617 ページ: 70-85
- DOI
  10.1007/978-3-031-20643-6_6
- ISBN
  9783031206429, 9783031206436
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] Accessing the Suffix Array via $\phi^-1$-Forest2022
- 著者名/発表者名
  Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
- 雑誌名
  
  Proceedings of SPIRE
  
  巻: 13617 ページ: 86-98
- DOI
  10.1007/978-3-031-20643-6_7
- ISBN
  9783031206429, 9783031206436
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Computing NP-hard Repetitiveness Measures via MAX-SAT2022
- 著者名/発表者名
  Hideo Bannai and Keisuke Goto and Masakazu Ishihata and Shunsuke Kanda and Dominik Koeppl and Takaaki Nishimoto
- 雑誌名
  
  Proceedings of ESA
  
  巻: 244
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices2022
- 著者名/発表者名
  Paolo Ferragina and Giovanni Manzini and Travis Gagie and Dominik Koeppl and Gonzalo Navarro and Manuel Striani and Francesco Tosoni
- 雑誌名
  
  Proc. VLDB
  
  巻: 15 号: 10 ページ: 2175-2187
- DOI
  10.14778/3547305.3547321
- 関連する報告書
  2022 実績報告書
- 査読あり / 国際共著
[雑誌論文] FM-Indexing Grammars Induced by Suffix Sorting for Long Patterns2022
- 著者名/発表者名
  Jin Jie Deng and Wing-Kai Hon and Dominik Koeppl and Kunihiko Sadakane
- 雑誌名
  
  Proc. DCC
  
  巻: 83--92 ページ: 63-72
- DOI
  10.1109/dcc52660.2022.00014
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] HOLZ: High-Order Entropy Encoding of {Lempel--Ziv} Factor Distances2022
- 著者名/発表者名
  Dominik Koeppl and Gonzalo Navarro and Nicola Prezza
- 雑誌名
  
  Proc. DCC
  
  巻: 2022 ページ: 83-92
- DOI
  10.1109/dcc52660.2022.00016
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] Computing Lexicographic Parsings2022
- 著者名/発表者名
  Koeppl Dominik
- 雑誌名
  
  Proc. DCC
  
  巻: 2022 ページ: 232-241
- DOI
  10.1109/dcc52660.2022.00031
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] Inferring Spatial Distance Rankings with Partial Knowledge on Routing Networks2022
- 著者名/発表者名
  Koeppl Dominik
- 雑誌名
  
  Information
  
  巻: 13 号: 4 ページ: 168-168
- DOI
  10.3390/info13040168
- 関連する報告書
  2021 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] c-trie++: A dynamic trie tailored for fast prefix searches2021
- 著者名/発表者名
  Kazuya Tsuruta, Dominik Koeppl, Shunsuke Kanda, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
- 雑誌名
  
  Information and Computation
  
  巻: - ページ: 104794-104794
- DOI
  10.1016/j.ic.2021.104794
- 関連する報告書
  2022 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Reversed Lempel-Ziv Factorization with Suffix Trees2021
- 著者名/発表者名
  Koeppl Dominik
- 雑誌名
  
  Algorithms
  
  巻: 14 号: 6 ページ: 161-161
- DOI
  10.3390/a14060161
- 関連する報告書
  2021 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Constructing the Bijective and the Extended Burrows-Wheeler Transform in Linear Time2021
- 著者名/発表者名
  Hideo Bannai and Juha Kaerkkaeinen and Dominik Koeppl and Marcin Piト?tkowski
- 雑誌名
  
  Proceedings of CPM
  
  巻: 191
- 関連する報告書
  2021 実績報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] Extracting the Sparse Longest Common Prefix Array from the Suffix Binary Search Tree2021
- 著者名/発表者名
  I Tomohiro、Irving Robert、Koeppl Dominik、Love Lorna
- 雑誌名
  
  Proc. SPIRE
  
  巻: 12944 ページ: 143-150
- DOI
  10.1007/978-3-030-86692-1_12
- ISBN
  9783030866914, 9783030866921
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] Grammar Index by Induced Suffix Sorting2021
- 著者名/発表者名
  Tooru Akagi, Dominik Koeppl, Yuto Nakashima, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda
- 雑誌名
  
  Proceedings of 28th International Symposium on String Processing and Information Retrieval
  
  巻: 12944 ページ: 85-99
- DOI
  10.1007/978-3-030-86692-1_8
- ISBN
  9783030866914, 9783030866921
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] A Separation of $$\gamma $$ and b via Thue-Morse Words2021
- 著者名/発表者名
  Bannai Hideo、Funakoshi Mitsuru、I Tomohiro、Koeppl Dominik、Mieno Takuya、Nishimoto Takaaki
- 雑誌名
  
  Proceedings of the 28th International Symposium on String Processing and Information Retrieval (SPIRE 2021)
  
  巻: LNCS 12944 ページ: 167-178
- DOI
  10.1007/978-3-030-86692-1_14
- ISBN
  9783030866914, 9783030866921
- 関連する報告書
  2021 実績報告書
- 査読あり / 国際共著
[雑誌論文] Engineering Practical Lempel-Ziv Tries2021
- 著者名/発表者名
  Diego Arroyuelo and Rodrigo Cテ。novas and Johannes Fischer and Dominik Koeppl and Marvin Loebel and Gonzalo Navarro and Rajeev Raman
- 雑誌名
  
  ACM JEA
  
  巻: 26 ページ: 1-47
- DOI
  10.1145/3481638
- 関連する報告書
  2021 実績報告書
- 査読あり / オープンアクセス / 国際共著
[学会発表] r インデックスにおける接尾辞配列を模倣するデータ構造2023
- 著者名/発表者名
  Christina Boucher and Dominik Koeppl and Herman Perera and Massimiliano Rossi
- 学会等名
  Local Proceedings of the LA Symposium Winter 2022
- 関連する報告書
  2022 実績報告書
[学会発表] アルファベット順による lex-parse サイズ比2023
- 著者名/発表者名
  中島祐人 and クップルドミニク and 舩越満 and 稲永俊介
- 学会等名
  Local Proceedings of the 191th アルゴリズム研究会
- 関連する報告書
  2022 実績報告書
[学会発表] 接尾辞木に基づくLZ77とLPF配列の変種の計算2022
- 著者名/発表者名
  クップルドミニク
- 学会等名
  Local Proceedings of コンピュテーション研究会
- 関連する報告書
  2022 実績報告書
[学会発表] Lempel-Ziv 項の距離を高次情報量で表現する符号2022
- 著者名/発表者名
  Dominik Koeppl and Gonzalo Navarro and Nicola Prezza
- 学会等名
  Local Proceedings of the 190th アルゴリズム研究会
- 関連する報告書
  2022 実績報告書
[学会発表] SATソルバを用いたNP困難な圧縮指標の高速計算2022
- 著者名/発表者名
  坂内英夫 and 後藤啓介 and 石畠正和 and 神田峻介 and クップルドミニク and 西本崇晃
- 学会等名
  人工知能学会研究会資料人工知能基本問題研究会
- 関連する報告書
  2021 実績報告書
[学会発表] 省領域な lexicographic parse 構築アルゴリズム2021
- 著者名/発表者名
  クップルドミニク
- 学会等名
  Local Proceedings of コンピュテーション研究会
- 関連する報告書
  2021 実績報告書
[学会発表] Computation of Variations of the LZ77 factorization and the LPF Array with Suffix Trees2021
- 著者名/発表者名
  Dominik Koeppl
- 学会等名
  WCTA
- 関連する報告書
  2021 実績報告書
- 国際学会
[備考] Private Homepage
- URL
  https://dkppl.de/
- 関連する報告書
  2022 実績報告書
[備考] Personal Homepage
- URL
  https://dkppl.de/
- 関連する報告書
  2021 実績報告書

Resource-Constraint Privacy-Aware Data Structures Tackling Problems in Bioinformatics

研究代表者

Koeppl Dominik 東京医科歯科大学, M&Dデータ科学センター, 助教 (50897395)

5,200千円 (直接経費: 4,000千円、間接経費: 1,200千円)

報告書

研究成果

[国際共同研究] University of Florida/Illumina Inc(米国)

関連する報告書

[国際共同研究] University of Pisa(イタリア)

関連する報告書

[国際共同研究] Lodz University of Technology/University of Piemonte Orientale(ポーランド)

関連する報告書

[国際共同研究] Nicolaus Copernicus University(ポーランド)

関連する報告書

[国際共同研究] University of Glasgow/University of Leicester(英国)

関連する報告書

[国際共同研究] Millennium Institute/Tecnica Federico Santa Maria/University of Chile(チリ)

関連する報告書

[国際共同研究] Baker Heart and Diabetes Institute(オーストラリア)

関連する報告書

[国際共同研究] National Tsing Hua University(台湾)

関連する報告書

[雑誌論文] Dynamic Skyline Computation with LSD Trees2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Space-efficient Huffman codes revisited2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Graph Compression for Adjacency-Matrix Multiplication2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Computing Longest (Common) Lyndon Subsequences2022

著者名/発表者名

雑誌名

DOI

ISBN

関連する報告書

[雑誌論文] Space-Efficient B Trees via Load-Balancing2022

著者名/発表者名

雑誌名

DOI

ISBN

関連する報告書

[雑誌論文] Linking Off-Road Points to Routing Networks2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Fast and Simple Compact Hashing via Bucketing2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] Computing the Parameterized Burrows-Wheeler Transform Online2022

著者名/発表者名

雑誌名

DOI

ISBN

関連する報告書

[雑誌論文] Accessing the Suffix Array via $\phi^-1$-Forest2022

著者名/発表者名

雑誌名

DOI

ISBN

関連する報告書

[雑誌論文] Computing NP-hard Repetitiveness Measures via MAX-SAT2022

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices2022

著者名/発表者名

雑誌名

DOI

関連する報告書