Project/Area Number |
07558159
|
Research Category |
Grant-in-Aid for Scientific Research (A)
|
Allocation Type | Single-year Grants |
Section | 展開研究 |
Research Field |
計算機科学
|
Research Institution | Kyushu Institute of Technology |
Principal Investigator |
SHINOHARA Takeshi Kyushu Institute of Technology, Department of Atrificial Intelligence, Professor, 情報工学部, 教授 (60154225)
|
Co-Investigator(Kenkyū-buntansha) |
FUKAMACHI Shuichi Kyushu Institute of Technology, Department of Atrificial Intelligence, Research, 情報工学部, 助手 (30274559)
SHIMOZONO Shinichi Kyushu Institute of Technology, Department of Atrificial Intelligence, Associate, 情報工学部, 助教授 (70243988)
ISHIZAKA Hiroki Kyushu Institute of Technology, Department of Atrificial Intelligence, Associate, 情報工学部, 助教授 (70260726)
杉本 典子 九州工業大学, 情報工学部, 教務職員 (80271120)
有村 博紀 九州工業大学, 情報工学部, 助教授 (20222763)
|
Project Period (FY) |
1995 – 1997
|
Project Status |
Completed (Fiscal Year 1997)
|
Budget Amount *help |
¥3,700,000 (Direct Cost: ¥3,700,000)
Fiscal Year 1997: ¥1,300,000 (Direct Cost: ¥1,300,000)
Fiscal Year 1996: ¥2,400,000 (Direct Cost: ¥2,400,000)
|
Keywords | Information Retrieval / Sequential Pattern Matching / Data Compression / Text Database |
Research Abstract |
The objective of this research is to establish a speedup method for sequential pattern matching by data compression and demonstrate its availability in text database. We design a pattern matching machine for compressed data by Huffman codes without decoding. In the experiment on this algorithm, although the effect of this method depends on the characteristics of data, the text size and the response time of searching are reduced to 60% and 70%, respectively, for English text. We also design a similar technique for new compression scheme, called Byte-Pair-Encoding (BPE,for short). This technique compresses English text to around 50% and reduces search time to 60%. BPE is basically a fixed length code, and therefore compresses text by BPE is efficiently distributed to processors in parallel environment.
|