• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A fast n-gram full text search independent of document size and it application to a huge text base

Research Project

Project/Area Number 10480082
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field 情報システム学(含情報図書館学)
Research InstitutionThe University of Tokushima

Principal Investigator

AOE Junichi  The University of Tokushima, Information Science, Professor, 工学部, 教授 (90108853)

Co-Investigator(Kenkyū-buntansha) SHISHIBORI Masami  The University of Tokushima, Information Science, Associate Professor, 工学部, 助教授 (50274262)
SATO Takashi  Osaka Kyoiku University, Information Science, Associate Professor, 教育学部, 助教授 (20124117)
KITA Kenji  The University of Tokushima, Information Science, Professor, 工学部, 教授 (10243734)
Project Period (FY) 1998 – 2001
Project Status Completed (Fiscal Year 2001)
Budget Amount *help
¥11,500,000 (Direct Cost: ¥11,500,000)
Fiscal Year 2001: ¥2,200,000 (Direct Cost: ¥2,200,000)
Fiscal Year 2000: ¥2,400,000 (Direct Cost: ¥2,400,000)
Fiscal Year 1999: ¥3,200,000 (Direct Cost: ¥3,200,000)
Fiscal Year 1998: ¥3,700,000 (Direct Cost: ¥3,700,000)
KeywordsText database / Retrieval method / Keyword Search / test retrieval / 文書データ / 全文検索
Research Abstract

Research results are evaluated as follows:
1. Improvement of dynamic n-gram storage structures
The ratio of improving approaches by Heisei 11, 12 and 13 became 25 %.
2. Compression evaluation for postings
Experimental results in the final year reaches 85 % compression to the previous approaches.
3. Retrieval Efficiency in partial Matching
The speed is improved about 30 % for the old in des tables.
4. Evaluation of practical text databases
It is verified that n-gram full text search presented in this search is independent of the size of documents.

Report

(5 results)
  • 2001 Annual Research Report   Final Research Report Summary
  • 2000 Annual Research Report
  • 1999 Annual Research Report
  • 1998 Annual Research Report
  • Research Products

    (33 results)

All Other

All Publications (33 results)

  • [Publications] M.Jung: "A Dynamic Construction Algorithm for the Compact tree"Information Processing & Management. 38. 221-236 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] S.Lee, J.Aoe: "Extraction of Field coherent passages"Information Processing & Management. 38. 173-207 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] EL-Sayed AHam: "Words Tendency Depending on Time-Series Variation"Information Processing & Management. 38. 157-171 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 森田和宏: "ダブル配列における動的更新の効率化アルゴリズム"情報処理学会論文誌. 42. 2229-2238 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Y.Yamakawa: "A Method for Improving Full Text Search Using Signature Files"Computer Mathematics. 77. 73-88 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Jung,M., Aoe,J.: "A dynamic Construction Algorithm for the compact tree"Information Processing & Management. 38, No. 2. 221-236 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Lee,S., Aoe,J.: "Extraction of Field Coherent Passages"Information Processing & Management. 38, No. 2. 173-207 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Atlam,El-Sayed, Aoe,J.: "Words Tendancy Repending on Time-Series Variation"Information Processing & Management. 38, No. 2. 157-171 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Morita,Kazuhiro, Aoe,J.: "Dynamic Updating Method of Double-Array Structures"Information Processing. 77, No. 5. 73-88 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Yamakawa,Y., Aoe,J.: "A Method for Improving Full Text Search Using Signatures"Computer Mathematics. 42. 73-88 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Jung: "A Dynamic Construction Algorithm forthe Compact tree"Information Processing & Management. 38. 221-236 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] S, Lee: "Extraction of Field coherent passages"Information Processing Management. 38. 173-207 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] EL-Sayed Atlam: "Words Tendency Depending on Time-series Variation"Information Processing & Management. 38. 157-171 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 森田和宏: "ダブル配列における動的更新の効率化アルゴリズム"情報処理学会論文誌. 42. 2229-2238 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Y.Yamakawa: "A Method for Improving Full Text Search Using Signature Files"Computer Mathematics. 77. 73-88 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] K.Morita: "Fast insertion methods of a double-array structure"Software Practice & Experience. 31・1. 43-65 (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] M.koyama: "A fast and compact technique of implementing transition tables for finite state automata"International Journal of Information Sciences. 129. 141-154 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] S.Mizobuchi: "An Efficient Representation for Implementing Finite State Machines Based on the Double-Array"International Journal of Information Sciences. 129. 119-139 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] M.Shishibori: "A Fast Correction Method for Erroneous Sentences Using the LR Parsing"IEICE Transactions on Information and Systems. E83-D・9. 1797-1804 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 獅々堀正幹: "多属性項目の履歴情報に基づくメイル文書のフィルタリング手法"情報処理学会論文誌. 141・8. 2299-2308 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] M.Fuketa: "A Document Classification Method by using Field Association Words"International Journal of Information Sciences. 126. 57-70 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 辻 孝子: "複合語の分野連想語の効率的決定法"自然言語処理. 7・2. 111-125 (2000)

    • Related Report
      1999 Annual Research Report
  • [Publications] E-S.Atlam: "Similarity Measurement Using Negative Weight Function"Information Processing & Management. (印刷中). (2000)

    • Related Report
      1999 Annual Research Report
  • [Publications] K.Morita: "A Link Trie Structure of Staring Multi-Attribute"International Journal of Computer Mathematics. 118・2. 145-157 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 溝渕 昭二: "日本語時間表現の一解釈法"情報処理学会論文誌. 40・9. 3408-3419 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] M.Fuketa: "Efficient Controlling of Parsing-Stack Operation"International Journal of Information Sciences. 118・1. 145-157 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] N.Shishibori: "Two Improved Access Methods on Compact Binary Trees"Information Processing & Management. (印刷中). (2000)

    • Related Report
      1999 Annual Research Report
  • [Publications] M.Fuketa: "A Fast Method of Determining Weighted Compound Keywards from Text Databases" International Journal of Information Processing & Management. 34・4. 431-442 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] M.Koyama: "A Fast Retrieving Algorithm of Hierarchical Relationships" International Journal of Information Processing & Management. 34・6. 761-773 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 森田和宏: "トライ構造を用いた共起情報の効率的検索アルゴリズム" 情報処理学会論文誌. 39・9. 2563-2571 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] K.Ando: "An Extended Pattern Matching Machine for Document Processing" Computer Processing of Oriental Languages. 11・3. 223-248 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] M.Fuketa: "A Fast Algorithm of Retrieving Common Sentences" International Journal of Information Sciences. 109・4. 265-279 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] H.Mochizuki: "A Substring Search Algorithm in Extensible Hashing" International Journal of Information Sciences. 108・4. 13-30 (1998)

    • Related Report
      1998 Annual Research Report

URL: 

Published: 1998-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi