• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Information Extraction and Retrieval from Lange Text Data

Research Project

Project/Area Number 08458081
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionKyushu Institute of Technology

Principal Investigator

NOMURA Hirosato  Kyushu Institute of Technology, Department of Artificial Intelligence, Professor, 情報工学部, 教授 (30208392)

Co-Investigator(Kenkyū-buntansha) NAGAI Hidetoshi  Kyushu Institute of Technology, Department of Artificial Intelligence, Assistant, 情報工学部, 助手 (60237485)
NAKAMURA Teigo  Kyushu Institute of Technology, Department of Artificial Intelligence, Lecturer, 情報工学部, 講師 (40198221)
Project Period (FY) 1996 – 1998
Project Status Completed (Fiscal Year 1998)
Budget Amount *help
¥7,600,000 (Direct Cost: ¥7,600,000)
Fiscal Year 1998: ¥1,700,000 (Direct Cost: ¥1,700,000)
Fiscal Year 1997: ¥2,000,000 (Direct Cost: ¥2,000,000)
Fiscal Year 1996: ¥3,900,000 (Direct Cost: ¥3,900,000)
KeywordsIntelligent Information Access / Information Extraction / Information Retrieval / Information Summarization / 自然言語処理 / テキスト処理 / 大量テキスト / 対話処理 / ファジィ理論
Research Abstract

This research concerns Information Extraction, Information Retrieval, and Information Summarization form a large text data set. The approach is based on a pattern-match processing which utilizes surface characteristics in linguistic representations. This method does not require any heavy linguistic processing and any deep analysis of semantic information while it results in high quality and high speed information processing for Information Extraction, Information Retrieval, and Information Sunmarization.
First of all, we elaborated on a Dialogue System for Information Retrieval. It is possible that a user's request is vague and unclear. We investigated a method for making clear the user's request by providing a cooperative and friendly navigation agent and by incorporating a processing strategy which applies a fuzzy calculation for disambiguation.
Second of all, we studied Information Extraction from News Articles. We investigated several useful strategies for designing templates for pattern-matching. We actually developed a large set of templates from 2000 news articles concerning new products.
Last of all, we analyzed linguistic characteristics of sentence endings and then proposed a semantic model for sentence types. By applying this investigation, we proposed a strategy for producing a text summarization by eliminating unimportant sentences and then combining remaining sentences as an article.
All of the experimental systems developed by ourselves are ready for demonstrations on the Web at out HomePage on the internet.

Report

(4 results)
  • 1998 Annual Research Report   Final Research Report Summary
  • 1997 Annual Research Report
  • 1996 Annual Research Report
  • Research Products

    (10 results)

All Other

All Publications (10 results)

  • [Publications] 高尾宜之、永井秀利、中村貞吾、野村浩郷: "複数製品の紹介記事からの製品情報抽出-製品記述パターンの分析-" 情報処理学会研究報告、自然言語処理研究会. 99・2. 117-124 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] Y.Takao, H.Nagai, T.Nakamura, H.Nomura: "Information Extraction from Newspaper Articles of Multiple Products" Proc.of Natural Language Processsing Interest Group, Information Processing Society of Japan. Vol.99, No.2. 117-124 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1998 Final Research Report Summary
  • [Publications] 高尾宜之、永井秀利、中村貞吾、野村浩郷: "複数製品の紹介記事からの製品情報抽出ー製品記述パターンの分析ー" 情報処理学会研究報告 自然言語処理研究会. 99・2. 117-124 (1999)

    • Related Report
      1998 Annual Research Report
  • [Publications] 井出裕二: "単一項目テンプレートを用いた新聞記事からの製品情報抽出" 平成9年度電気関係学会九州支部連合大会論文集. 292-292 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 井出裕二: "単一項目テンプレートによる新聞記事からの製品情報抽出" 情報処理学会研究報告 自然言語処理研究会. 97・109 97-NL-122. 63-70 (1997)

    • Related Report
      1997 Annual Research Report
  • [Publications] 中村貞吾: "文タイプと文間関係に基づく要約処理" 言語処理学会第4回年次大会ワークショップ「テキスト要約の現状と将来」論文集. 50-55 (1998)

    • Related Report
      1997 Annual Research Report
  • [Publications] 井出裕二: "構造化テンプレートを用いた新聞記事からの製品情報抽出" 情報処理学会研究報告 自然言語処理研究会. 97・29 97-NL-118. 7-14 (1997)

    • Related Report
      1996 Annual Research Report
  • [Publications] 藤吉誠: "情報抽出処理のためのテンプレート作成" 電気関係学会九州支部連合大会講演論文集. No.1332. 694-694 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 井出裕二: "テンプレートを用いた新聞記事からの製品情報抽出システム" 情報処理学会研究報告 自然言語処理研究会. 96・87 96-NL-115. 83-90 (1996)

    • Related Report
      1996 Annual Research Report
  • [Publications] 野村浩郷: "電子化テキストコーパスの課題と展望" 情報処理学会「大規模テキストコーパスの作成と共有の問題点」シンポジウム. 1-6 (1996)

    • Related Report
      1996 Annual Research Report

URL: 

Published: 1996-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi