• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1997 Fiscal Year Final Research Report Summary

Speedup of Text Database by Data Compression

Research Project

Project/Area Number 07558159
Research Category

Grant-in-Aid for Scientific Research (A)

Allocation TypeSingle-year Grants
Section展開研究
Research Field 計算機科学
Research InstitutionKyushu Institute of Technology

Principal Investigator

SHINOHARA Takeshi  Kyushu Institute of Technology, Department of Atrificial Intelligence, Professor, 情報工学部, 教授 (60154225)

Co-Investigator(Kenkyū-buntansha) FUKAMACHI Shuichi  Kyushu Institute of Technology, Department of Atrificial Intelligence, Research, 情報工学部, 助手 (30274559)
SHIMOZONO Shinichi  Kyushu Institute of Technology, Department of Atrificial Intelligence, Associate, 情報工学部, 助教授 (70243988)
ISHIZAKA Hiroki  Kyushu Institute of Technology, Department of Atrificial Intelligence, Associate, 情報工学部, 助教授 (70260726)
Project Period (FY) 1995 – 1997
KeywordsInformation Retrieval / Sequential Pattern Matching / Data Compression / Text Database
Research Abstract

The objective of this research is to establish a speedup method for sequential pattern matching by data compression and demonstrate its availability in text database.
We design a pattern matching machine for compressed data by Huffman codes without decoding. In the experiment on this algorithm, although the effect of this method depends on the characteristics of data, the text size and the response time of searching are reduced to 60% and 70%, respectively, for English text.
We also design a similar technique for new compression scheme, called Byte-Pair-Encoding (BPE,for short). This technique compresses English text to around 50% and reduces search time to 60%. BPE is basically a fixed length code, and therefore compresses text by BPE is efficiently distributed to processors in parallel environment.

  • Research Products

    (14 results)

All Other

All Publications (14 results)

  • [Publications] 宮崎 哲司: "圧縮された日本語テキストのためのパターン照合機械の設計" 情報処理学会第51回全国大会講演論文集. 4. 239-240 (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 深町 修一: "文字列パターン照合のための損失のあるデータ圧縮" 電子情報通信学会技術研究報告. 95. 41-48 (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Michiyo Yamaguchi: "Finding minimal generalization over regular patterns with alphabet indexing" Proc.the 7th Workshop on Genome Informatics. 51-60 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Takeshi Shinohara: "Inductive inference of unbounded unions of pattern languages from positive data" Proc the 7th International Workshop on Algorithmic Learning Theory(Lecture Notes in Artificial Intelligence). 1160. 256-271 (1996)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Naoyuki Harada: "A class of elementary formal systems that has an efficient parsing algorithm" The 7th European-Japanese Conference on Information Modelling and Knowledge Bases. 89-101 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Hiroki Arimura: "Learning unions of tree patterns using queries" Theoretical Computer Science(Netherlands). 185. 47-62 (1997)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K.P.Jantke, T.Shinohara, T.Zeugmann(Eds.): "Algorithmic Learning Theory" Springer-Verlag, 319 (1995)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] H.Arimura, T.Shinohara: "Logical genetalization for learning with background knowledge" ICLP '95 post-Conference Workshop on Inductive Logic Programming. IA-TR-95-03. 1-11 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] T.Shinohara, H.Arimura: "Inductive inference of unbounded unions of pattern languages from positive data" Proc.the 7th International Workshop on Algorithmic Learning Theory (Lecture Notes in Artificial Intelligennce 1160). 256-271 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Yamaguchi, S.Shimozono, T.Shinohara: "Finding minimal generalization over regular patterns with alphabet indexing" Proc.the 7th-Workshop on Genome Informatics. 51-60 (1996)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S.Matsumoto, A.Shinohara, H.Arimura, T.Shinohara: "Learning subsequence languages" Information Modelling and Knowledge Bases, VIII,IOS Press. 335-344 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N,Harada, S.Arikawa, H.Ishizaka: "A class of elementary formal systems that has an efficient parsing algorithm" Information Modelling and knowledge Bases, VIII,IOS Press. 89-101 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H.Arimura, H.Ishizaka, T.Shinohara: "Learning unions of tree patterns using queries" Theretical Computer Science (Netherlands). 47-62 (1997)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] K.P.Jantke, T.Shinohara, T.Zeugmann (Eds.): Algorithmic Learning Theory. (Lecuture Notes in Artificial Intelligence 997), Springer-Verlag, 319 (1995)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 1999-03-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi