A Partial match technique for multi-attribute keys and its text retrieval application

Research Project

Project/Area Number	07558273
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	展開研究
Research Field	情報システム学(含情報図書館学)
Research Institution	The University of Tokushima
Principal Investigator	AOE Junichi The University of Tokushima Information Science Professor, 工学部, 教授 (90108853)
Co-Investigator(Kenkyū-buntansha)	ONO Norihiko The University of Tokushima Information Science Professor, 工学部, 教授 (60194594) SATO Takashi Osaka-kyoiku University Information Science Asso.Professor, 教育学部, 助教授 (20124117)
Project Period (FY)	1995 – 1997
Project Status	Completed (Fiscal Year 1997)
Budget Amount *help	¥3,100,000 (Direct Cost: ¥3,100,000) Fiscal Year 1997: ¥1,600,000 (Direct Cost: ¥1,600,000) Fiscal Year 1996: ¥1,500,000 (Direct Cost: ¥1,500,000)
Keywords	partial match / keyword search / multi-attribute keys / text data base / information vetrieval / 多属性検索 / 文書処理
Research Abstract	Extracting keywords efficiently is an important task in text retrieval systems. In Japanese text, there are many compound words consisting some kinds of characters (Katakana, Kanji, etc.) and the text has no delimiter among words. Therefore, extracting keywords from such a text takes a lot of time. This research presents a technique of detecting keywords from compound keywords by introducing a set of rules, which represents multi-attribute conditions for keywords conctruction. A string pattern matching machine for a finit number of patterns is applied to matching of the rules and storing keyword candidates together with information bout both long term and short term words. The approach is estimated by theoretical analysis. By the simulation results for 34 Japanese text files, it has been that the algorithm presented has performed 19.4ms/KB and that the ratio of extracting expected keywords increase from the traditional approaches.

Report

(4 results)

1997 Annual Research Report Final Research Report Summary
1996 Annual Research Report
1995 Annual Research Report

Research Products
(27 results)

All Other

All Publications (27 results)

[Publications] S.Shishibori: "Design of a Compact Data Structure for the Patricia Trie" IECE Trans. on Information and Systems. (印刷中). (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] H.Mochizuki: "A Substring Search Algorithm in Extendeble Hashing" International Journal of Information Science. (印刷中). (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 有田健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 小山雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3. (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] M.Fuketa: "An efficient Algorithm for Retrieving Example Santences" International Journal of Information Sciences. (印刷中). (1998)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] 泓田正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] J.Aoe, K.Morimoto, M.Shishibori and K-H.Park: "A trie Compaction Algorithm for a Large Set of Keys" IEEE Transactions on Knowledge and Data Engineering. Vol.8, No.3. 476-491 (1996)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] M.Shishibori and J.Aoe: "Fast Allocation of Diagrams without Backtracking Processes" International Journal of Information Sciences. Vol.92, No.1-4. 65-85 (1996)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] T.Arita, M.Shishibori and J.Aoe: "An Efficient Algorithm for Full Text Retrieval for Multiple Keywords" International Journal of Information Sciences. Vol.104. 345-362 (1988)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] Masao Fuketa and Jun-ichi Aoe: "A Fast Algorithm of Retrieving Common Sentences." International Journal of Information Sciences. Vol.104 (in press). (1988)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] Masao Fuketa, Shoji Mizobuchi, Masami Shishibori and Jun-ichi Aoe: "An Efficient Algorithm for Retrieving Example Sentences." International Journal of Computer mathematics. Vol.66, No.3-4 (in press). (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] Masao Fuketa, Shoji Mizofuchi, and Jun-ichi Aoe: "A FAst Method of Determining Weighted Indexes from Text Databases" An International of Information Processing and Management. (in press). (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] H.Mochizuki, M.Koyama, M.Shishibori and J.Aoe: "A Substring Search Algorithm in Extendible Hashing" International Journal of Information Sciences. (in press). (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] M.Shishibori, M.Okada, T.Sumitomo and J.Aoe: "Design of a Compact Data Structure for the Patricia Trie" IEICE Transactions on Information and Systems. (in press). (1998)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1997 Final Research Report Summary
[Publications] S.Shishibori: "Design of a Compact Data Structure for the Patricia Trie" IECE Trans.on Information and Systems. (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] H.Mochizuki: "A Substring Search Algorithm in Extendible Hashing" International Journal of Information Science. (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] 有田健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] 小山雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3 (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] M.Fuketa: "An Efficient Algorithm for Retrieving Example Sentences" International Journal of Information Sciences. (印刷中). (1998)
- Related Report
  1997 Annual Research Report
[Publications] 泓田正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)
- Related Report
  1997 Annual Research Report
[Publications] J.AOE: "A Trie Compaction Algerithm for a Large Set of Keys" IEEETransactions on Knowledge and Data Eng.(1996)
- Related Report
  1996 Annual Research Report
[Publications] H.Iriguchi: "A Fast Retrieval Technique for Large Graph Structures" International J.of Computer Mathematics. (1996)
- Related Report
  1996 Annual Research Report
[Publications] M.Shishibari: "An Order Searching Algorithm of Extensible Hashing" Intnational J.of Computer Mathematics. (1996)
- Related Report
  1996 Annual Research Report
[Publications] J.AOE: "A Trie Compaction Algorithm for Large Set of Keys" IEEE Transactions on Knowledge and Data Engineering. (発表予定). (1996)
- Related Report
  1995 Annual Research Report
[Publications] H.Iriguchi: "A Fast Retrieval Technique for Large Graph Structures" International Journal of Computer Mathematics. (発表予定). (1996)
- Related Report
  1995 Annual Research Report
[Publications] M.Shishibori: "An Order Searching Algorithm of Extensible Hashing" International Journal of Computer Mathematics. (発表予定). (1996)
- Related Report
  1995 Annual Research Report
[Publications] K-H.Park: "An Automatic Selection Method of Key Search Algorithms" IECE Transactions on Information and Systems. E78-D. 383-393 (1995)
- Related Report
  1995 Annual Research Report

A Partial match technique for multi-attribute keys and its text retrieval application

Principal Investigator

AOE Junichi The University of Tokushima Information Science Professor, 工学部, 教授 (90108853)

¥3,100,000 (Direct Cost: ¥3,100,000)

Report

Research Products

[Publications] S.Shishibori: "Design of a Compact Data Structure for the Patricia Trie" IECE Trans. on Information and Systems. (印刷中). (1998)

Description

Related Report

[Publications] H.Mochizuki: "A Substring Search Algorithm in Extendeble Hashing" International Journal of Information Science. (印刷中). (1998)

Description

Related Report

[Publications] 有田 健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)

Description

Related Report

[Publications] 小山 雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3. (1998)

Description

Related Report

[Publications] M.Fuketa: "An efficient Algorithm for Retrieving Example Santences" International Journal of Information Sciences. (印刷中). (1998)

Description

Related Report

[Publications] 泓田 正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)

Description

Related Report

[Publications] J.Aoe, K.Morimoto, M.Shishibori and K-H.Park: "A trie Compaction Algorithm for a Large Set of Keys" IEEE Transactions on Knowledge and Data Engineering. Vol.8, No.3. 476-491 (1996)

Description

Related Report

[Publications] M.Shishibori and J.Aoe: "Fast Allocation of Diagrams without Backtracking Processes" International Journal of Information Sciences. Vol.92, No.1-4. 65-85 (1996)

Description

Related Report

[Publications] T.Arita, M.Shishibori and J.Aoe: "An Efficient Algorithm for Full Text Retrieval for Multiple Keywords" International Journal of Information Sciences. Vol.104. 345-362 (1988)

Description

Related Report

[Publications] Masao Fuketa and Jun-ichi Aoe: "A Fast Algorithm of Retrieving Common Sentences." International Journal of Information Sciences. Vol.104 (in press). (1988)

Description

Related Report

[Publications] Masao Fuketa, Shoji Mizobuchi, Masami Shishibori and Jun-ichi Aoe: "An Efficient Algorithm for Retrieving Example Sentences." International Journal of Computer mathematics. Vol.66, No.3-4 (in press). (1998)

Description

Related Report

[Publications] Masao Fuketa, Shoji Mizofuchi, and Jun-ichi Aoe: "A FAst Method of Determining Weighted Indexes from Text Databases" An International of Information Processing and Management. (in press). (1998)

Description

Related Report

[Publications] H.Mochizuki, M.Koyama, M.Shishibori and J.Aoe: "A Substring Search Algorithm in Extendible Hashing" International Journal of Information Sciences. (in press). (1998)

Description

Related Report

[Publications] M.Shishibori, M.Okada, T.Sumitomo and J.Aoe: "Design of a Compact Data Structure for the Patricia Trie" IEICE Transactions on Information and Systems. (in press). (1998)

Description

Related Report

[Publications] S.Shishibori: "Design of a Compact Data Structure for the Patricia Trie" IECE Trans.on Information and Systems. (印刷中). (1998)

Related Report

[Publications] H.Mochizuki: "A Substring Search Algorithm in Extendible Hashing" International Journal of Information Science. (印刷中). (1998)

Related Report

[Publications] 有田 健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)

Related Report

[Publications] 小山 雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3 (印刷中). (1998)

Related Report

[Publications] M.Fuketa: "An Efficient Algorithm for Retrieving Example Sentences" International Journal of Information Sciences. (印刷中). (1998)

Related Report

[Publications] 泓田 正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)

Related Report

[Publications] J.AOE: "A Trie Compaction Algerithm for a Large Set of Keys" IEEETransactions on Knowledge and Data Eng.(1996)

Related Report

[Publications] H.Iriguchi: "A Fast Retrieval Technique for Large Graph Structures" International J.of Computer Mathematics. (1996)

Related Report

[Publications] M.Shishibari: "An Order Searching Algorithm of Extensible Hashing" Intnational J.of Computer Mathematics. (1996)

Related Report

[Publications] J.AOE: "A Trie Compaction Algorithm for Large Set of Keys" IEEE Transactions on Knowledge and Data Engineering. (発表予定). (1996)

Related Report

[Publications] H.Iriguchi: "A Fast Retrieval Technique for Large Graph Structures" International Journal of Computer Mathematics. (発表予定). (1996)

Related Report

[Publications] M.Shishibori: "An Order Searching Algorithm of Extensible Hashing" International Journal of Computer Mathematics. (発表予定). (1996)

Related Report

[Publications] K-H.Park: "An Automatic Selection Method of Key Search Algorithms" IECE Transactions on Information and Systems. E78-D. 383-393 (1995)

Related Report

[Publications] 有田健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)

[Publications] 小山雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3. (1998)

[Publications] 泓田正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)

[Publications] 有田健: "特徴ベクトルによる全文検索の一改善法" 情報処理学会論文誌. (印刷中). (1998)

[Publications] 小山雅史: "格構造解析における概念階層の効率的判定アルゴリズム" 情報処理学会論文誌. 39・3 (印刷中). (1998)

[Publications] 泓田正雄: "大規模文書データに対する用例文の効率的検索アルゴリズム" 情報処理学会論文誌. 38・10. 2004-2013 (1997)