• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

String Indexing Based on Space-Optimal Grammar Compression and Its Application to Knowledge Discovery from Stream Data

Research Project

Project/Area Number 18K18111
Research Category

Grant-in-Aid for Early-Career Scientists

Allocation TypeMulti-year Fund
Review Section Basic Section 61030:Intelligent informatics-related
Research InstitutionKyushu Institute of Technology

Principal Investigator

Takabatake Yoshimasa  九州工業大学, 大学院情報工学研究院, 特任助教 (20807010)

Project Period (FY) 2018-04-01 – 2021-03-31
Project Status Completed (Fiscal Year 2020)
Budget Amount *help
¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)
Fiscal Year 2020: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2019: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2018: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywordsデータ圧縮 / 圧縮索引 / 圧縮情報処理 / 文法圧縮 / BWT / 文字列検索 / ランダムアクセス / 圧縮検索 / 秘匿計算 / 移動付き編集距離 / テキストデータ圧縮 / オンラインアルゴリズム
Outline of Final Research Achievements

Highly repetitive texts exceed TB and are still increasing. In this research, we developed grammar compressions and Online Run-Length BWTs (ORLBWTs), which can compress such large streaming data at high speed in compressed space. Furthermore, we developed various information processes on the compressed data. Although we could not develop a grammar-based compressed index supporting real-time keyword searches on large streaming data, we significantly improved the construction time of ORLBWTs and our ORLBWTs resulted in the development of an ORLBWT-based compressed index supporting real-time searches on large streaming data [Bannai et al. TCS2020].

Academic Significance and Societal Importance of the Research Achievements

開発した文法圧縮やOnline Run-Length BWT (ORLBWT)をTB超のデータをさらに省メモリかつ高速に圧縮可能になった.また,開発したORLBWTを応用したリアルタイムキーワード検索可能な圧縮索引を用いることで巨大なストリームデータから効率的に情報抽出可能となった.また,開発した各種圧縮情報処理技術を応用することで巨大なストリームデータからのリアルタイムの知識発見が可能とすることが期待できる.

Report

(4 results)
  • 2020 Annual Research Report   Final Research Report ( PDF )
  • 2019 Research-status Report
  • 2018 Research-status Report
  • Research Products

    (16 results)

All 2020 2019 2018 Other

All Int'l Joint Research (4 results) Journal Article (2 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 2 results,  Open Access: 2 results) Presentation (9 results) (of which Int'l Joint Research: 9 results) Remarks (1 results)

  • [Int'l Joint Research] Dalhousie University(カナダ)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Piemonte Orientale(イタリア)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Chile(チリ)

    • Related Report
      2020 Annual Research Report
  • [Int'l Joint Research] University of Siegen(ドイツ)

    • Related Report
      2020 Annual Research Report
  • [Journal Article] Re-Pair in Small Space2020

    • Author(s)
      Dominik Koeppl, Tomohiro I, Isamu Furuya, Yoshimasa Takabatake, Kensuke Sakai, Keisuke Goto,
    • Journal Title

      Algorithms

      Volume: 14(1) Issue: 1 Pages: 1-20

    • DOI

      10.3390/a14010005

    • Related Report
      2020 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] A faster implementation of online RLBWT and its application to LZ77 parsing2018

    • Author(s)
      Ohno Tatsuya、Sakai Kensuke、Takabatake Yoshimasa、I Tomohiro、Sakamoto Hiroshi
    • Journal Title

      Journal of Discrete Algorithms

      Volume: 52-53 Pages: 18-28

    • DOI

      10.1016/j.jda.2018.11.002

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access
  • [Presentation] Practical Random Access to SLP-Compressed Texts2020

    • Author(s)
      Travis Gagie, Tomohiro I, Giovanni Manzini, Gonzalo Navarro, Hiroshi Sakamoto, Louisa Seelbach Benkner, Yoshimasa Takabatake
    • Organizer
      The 27th International Symposium on String Processing and Information Retrieval (SPIRE)
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Re-Pair in Small Space2020

    • Author(s)
      Dominik Koppl, Tomohiro I, Isamu Furuya, Yoshimasa Takabatake, Kensuke Sakai, Keisuke Goto
    • Organizer
      Prague Stringology Conference (PSC) 2020
    • Related Report
      2020 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Faster Privacy-Preserving Computation of Edit Distance with Moves2020

    • Author(s)
      Yohei Yoshimoto, Masaharu Kataoka, Yoshimasa Takabatake,Tomohiro I, Kilho Shin, Hiroshi Sakamoto
    • Organizer
      The 14th International Workshop on Algorithms and Computation
    • Related Report
      2019 Research-status Report
    • Int'l Joint Research
  • [Presentation] Re-Pair in Small Space2020

    • Author(s)
      Dominik Dominik K{\"{o}}ppl , Tomohiro I, Isamu Furuya, Yoshimasa Takabatake, Kensuke Sakai, Keisuke Goto
    • Organizer
      Data Compression Conference
    • Related Report
      2019 Research-status Report
    • Int'l Joint Research
  • [Presentation] Rpair: Rescaling RePair with Rsync2019

    • Author(s)
      Travis Gagie, Tomohiro I, Giovanni Manzini, Gonzalo Navarro, Hiroshi Sakamoto, Yoshimasa Takabatake
    • Organizer
      The 26th International Symposium on String Processing and Information Retrieval
    • Related Report
      2019 Research-status Report
    • Int'l Joint Research
  • [Presentation] RePair in Compressed Space and Time2019

    • Author(s)
      Kensuke Sakai、Tatsuya Ohno、Keisuke Goto、Yoshimasa Takabatake、Tomohiro I、 Hiroshi Sakamoto
    • Organizer
      Data Compression Conference
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Privacy-Preserving String Edit Distance with Moves2018

    • Author(s)
      Shunta Nakagawa、Tokio Sakamoto、Yoshimasa Takabatake、Tomohiro I、Kilho Shin、Hiroshi Sakamoto
    • Organizer
      The 11th International Conference on Similarity Search and Applications
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] LZ-ABT: A Practical Algorithm for α-Balanced Grammar Compression2018

    • Author(s)
      Tatsuya Ohno、Keisuke Goto、Yoshimasa Takabatake、Tomohiro I、Hiroshi Sakamoto
    • Organizer
      The 29th International Workshop on Combinatorial Algorithms
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Improved Grammar Compression in Constant Space2018

    • Author(s)
      Reimi Tanaka、Yoshimasa Takabatake、Tomohiro I、Hiroshi Sakamoto
    • Organizer
      The 14th International Conference on Grammatical Inference
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Remarks] 私(高畠嘉将)のホームページ

    • URL

      http://www.donald.ai.kyutech.ac.jp/~takabatake/

    • Related Report
      2020 Annual Research Report

URL: 

Published: 2018-04-23   Modified: 2022-01-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi