• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2012 Fiscal Year Final Research Report

Pattern Discovery and Data Classification Based on String Compression

Research Project

  • PDF
Project/Area Number 22680014
Research Category

Grant-in-Aid for Young Scientists (A)

Allocation TypeSingle-year Grants
Research Field Intelligent informatics
Research InstitutionKyushu University

Principal Investigator

BANNAI Hideo  九州大学, システム情報研究院, 准教授 (20323644)

Project Period (FY) 2010 – 2012
Keywords圧縮文字列処理 / パターン発見
Research Abstract

Compressed string processing is an approach that aims to process a compressed representation of the string without explicitly decompressing it. In this study, we investigated the application of this approach to the problem of string pattern discovery and string data classification, and developed various efficient algorithms. Especially for the q-gram frequencies problem, we succeeded in developing a practically efficient algorithm that can be faster than directly processing the uncompressed text, showing, the effectiveness of the approach to the string pattern discovery and string classification problems

  • Research Products

    (21 results)

All 2013 2012 2011 2010

All Journal Article (12 results) Presentation (9 results)

  • [Journal Article] Simpler and Faster Lempel Ziv Factorization, Proc.2013

    • Author(s)
      Keisuke Goto and Hideo Bannai
    • Journal Title

      Data Compression Conference 2013 (DCC 2013)

      Pages: 133-142

  • [Journal Article] From Run Length Encoding to LZ78 and Back Again2013

    • Author(s)
      Yuya Tamakoshi, Tomohiro I, Shunsuke Inenaga, Hideo Bannai, and Masayuki Takeda
    • Journal Title

      Proc. Data Compression Conference 2013 (DCC 2013)

      Pages: 143-152

  • [Journal Article] Computing convolution on grammar-compressed text2013

    • Author(s)
      Toshiya Tanaka, Tomohiro I, Shunsuke Inenaga, Hideo Bannai, and Masayuki Takeda
    • Journal Title

      Proc. Data Compression Conference 2013 (DCC 2013)

      Pages: 451-460

  • [Journal Article] Fast q-gram mining on SLP compressed strings2013

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, and Masayuki Takeda
    • Journal Title

      Journal of Discrete Algorithms

      Volume: 18 Pages: 89-99

  • [Journal Article] Efficient LZ78 factorization of grammar compressed text, Proc2012

    • Author(s)
      Hideo Bannai, Shunsuke Inenaga, and Masayuki Takeda
    • Journal Title

      19th International Symposium on String Processing and Information Retrieval (SPIRE 2012)

      Volume: 7608 Pages: 86-98

  • [Journal Article] An Efficient Algorithm to Test Square-Freeness of Strings Compressed by Straight-Line Programs2012

    • Author(s)
      Hideo Bannai, Travis Gagie, Tomohiro I, Shunsuke Inenaga, Gad M. Landau, and Moshe Lewenstein
    • Journal Title

      Information Processing Letters

      Volume: 112(19) Pages: 711-714

  • [Journal Article] Speeding up q-gram mining on grammar-based compressed texts2012

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Journal Title

      Proc. 23rd Annual Symposium on Combinatorial Pattern Matching (CPM 2012)

      Volume: 7354 Pages: 220-231

  • [Journal Article] Computing q-gram Non-overlapping Frequencies on SLP Compressed Texts2012

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Journal Title

      Proc. 38th International Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2012)

      Volume: 7147 Pages: 301-312

  • [Journal Article] Finding Characteristic Substrings from Compressed Texts2012

    • Author(s)
      Shunsuke Inenaga, Hideo Bannai
    • Journal Title

      International Journal of Foundations of Computer Science

      Volume: 23(2) Pages: 261-280

  • [Journal Article] Fast q-gram Mining on SLP Compressed Strings2011

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Journal Title

      Proc. 18th International Symposium on String Processing and Information Retrieval (SPIRE 2011)

      Volume: 7024 Pages: 278-289

  • [Journal Article] Faster Subsequence and Don't-Care Pattern Matching on Compressed Texts2011

    • Author(s)
      Takanori Yamamoto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Journal Title

      Proc. 22nd Annual Symposium on Combinatorial Pattern Matching (CPM 2011)

      Volume: 6661 Pages: 309-322

  • [Journal Article] Sparse Substring Pattern Set Discovery using Linear Programming Boosting2010

    • Author(s)
      Kazuaki Kashihara, Kohei Hatano, Hideo Bannai, Masayuki Takeda
    • Journal Title

      Proc. 13th International Conference on Discovery Science (DS 2010)

      Volume: 6332 Pages: 132-143

  • [Presentation] Simpler and Faster Lempel Ziv Factorization2013

    • Author(s)
      Keisuke Goto and Hideo Bannai
    • Organizer
      Data Compression Conference 2013 (DCC 2013)
    • Place of Presentation
      Snowbird, USA.
    • Year and Date
      20130300
  • [Presentation] From Run Length Encoding to LZ78 and Back Again2013

    • Author(s)
      Yuya Tamakoshi, Tomohiro I, Shunsuke Inenaga, Hideo Bannai, and Masayuki Takeda
    • Organizer
      Data Compression Conference 2013 (DCC 2013)
    • Place of Presentation
      Snowbird, USA.
    • Year and Date
      20130300
  • [Presentation] Computing convolution on grammar-compressed text2013

    • Author(s)
      Toshiya Tanaka, Tomohiro I, Shunsuke Inenaga, Hideo Bannai, and Masayuki Takeda
    • Organizer
      Data Compression Conference 2013 (DCC 2013)
    • Place of Presentation
      Snowbird, USA.
    • Year and Date
      20130300
  • [Presentation] Efficient LZ78 factorization of grammar compressed text2012

    • Author(s)
      Hideo Bannai, Shunsuke Inenaga, and Masayuki Takeda
    • Organizer
      19th International Symposium on String Processing and Information Retrieval (SPIRE 2012)
    • Place of Presentation
      Cartagena, Colombia
    • Year and Date
      20121000
  • [Presentation] Speeding up q-gram mining on grammar-based compressed texts2012

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Organizer
      23rd Annual Symposium on Combinatorial Pattern Matching (CPM 2012)
    • Place of Presentation
      Helsinki, Finland
    • Year and Date
      20120700
  • [Presentation] Computing q-gram Non-overlapping Frequencies on SLP Compressed Texts2012

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Organizer
      38th International Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2012)
    • Place of Presentation
      spindleruv Mlyn, Czech Republic.
    • Year and Date
      20120100
  • [Presentation] Fast q-gram Mining on SLP Compressed Strings2011

    • Author(s)
      Keisuke Goto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Organizer
      18th International Symposium on String Processing and Information Retrieval (SPIRE 2011)
    • Place of Presentation
      Pisa, Italy.
    • Year and Date
      20111000
  • [Presentation] Faster Subsequence and Don't-Care Pattern Matching on Compressed Texts2011

    • Author(s)
      Takanori Yamamoto, Hideo Bannai, Shunsuke Inenaga, Masayuki Takeda
    • Organizer
      22nd Annual Symposium on Combinatorial Pattern Matching (CPM 2011)
    • Place of Presentation
      Palermo, Italy.
    • Year and Date
      20110600
  • [Presentation] Sparse Substring Pattern Set Discovery using Linear Programming Boosting2010

    • Author(s)
      Kazuaki Kashihara, Kohei Hatano, Hideo Bannai, Masayuki Takeda
    • Organizer
      13th International Conference on Discovery Science (DS 2010)
    • Place of Presentation
      Canberra, Australia.
    • Year and Date
      20101000

URL: 

Published: 2014-08-29  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi