• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Studies on Summarization of Multimedia Contents Based on Relational Structure between Text Tables and Images

Research Project

Project/Area Number 13680452
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionKyushu Institute Of Technology

Principal Investigator

ENDO Tsutomu  Kyushu Institute Of Technology, Faculty of Computer Science and Systems Engineering, Professor, 情報工学部, 教授 (10112294)

Co-Investigator(Kenkyū-buntansha) SHIMADA Kazutaka  Kyushu Institute Of Technology, Faculty of Computer Science and Systems Engineering, Research Associate, 情報工学部, 助手 (50346863)
徳久 雅人  九州工業大学, 情報工学部, 助手 (10274557)
Project Period (FY) 2001 – 2003
Project Status Completed (Fiscal Year 2003)
Budget Amount *help
¥2,400,000 (Direct Cost: ¥2,400,000)
Fiscal Year 2003: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 2002: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 2001: ¥1,200,000 (Direct Cost: ¥1,200,000)
Keywordsmultimedia document / WWW / summarization / information integration / information retrieval / information extraction / Kansei information / document generation
Research Abstract

This research intends to develop a system which summarizes product (PCs) information retrieved from Web sites based on relational structure between text, tables and images, and presents products suitable for a user's request.
1.Extraction of product specifications from HTML documents.
We proposed a method for extracting specifications from HTML documents using TSVMs (Transductive Support Vector Machines). The elements of a feature vector are keywords with normalized TF-DF weighting. We achieved 95% recall with 99% precision.
2.Characteristic-data extraction and support system for PC selection.
The specifications written in HTML are converted into normal form called table structure. The quantitative attributes are extracted by comparing them with the mean or mode of all sample data, and the qualitative ones are extracted using knowledge provided manually. The recommended PCs are dynamically determined from the extracted data by a user's request and relevance feedback. Moreover, a radar chart and Japanese sentences are generated from specifications.
3.Classification of images and feature extraction.
We proposed a method for classifying the contents of images using weighted keywords extracted from their neighboring sentences. We achieved 79% accuracy by TF-IDF weighting. We also developed a system which eliminates the background from a PC image, and classify the color of PC using C4.5.

Report

(4 results)
  • 2003 Annual Research Report   Final Research Report Summary
  • 2002 Annual Research Report
  • 2001 Annual Research Report
  • Research Products

    (30 results)

All Other

All Publications (30 results)

  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "Classification of Images Using Their Neighboring Sentences"Proceedings of PACLING2001(Pacific Association for Computational Linguistics 2001). 250-256 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 福本篤史, 遠藤勉, 嶋田和孝: "Information Extraction from Specifications on the World Wide Web"Proceedings of PACLING2001(Pacific Association for Computational Linguistics 2001). 109-116 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 嶋田和孝, 福本篤史, 遠藤勉: "Information Extraction from Personal Computer Specifications on the Web Using a User's Request"IEICE Transactions on Information and Systems. Vol.E86-D, No.8. 1386-1395 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "Multiform Summarization from Product Specifications"Proceedings of PACLING2003(Pacific Association for Computational Linguistics 2003). 83-92 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 嶋田和孝, 林晃司, 遠藤勉: "Keyword and Weighting for Product Specifications Extraction"Proceedings of PACLING2003(Pacific Association for Computational Linguistics 2003). 285-293 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 嶋田和孝, 遠藤勉他: "Information Modeling and Knowledge Bases XV"IOS Press. 333 (2004)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kazutaka Shimada, Tetsuro Ito, Tsutomu Endo: "Classification of Images Using Their Neighboring Sentences"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics 2001). 250-256 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Atsushi Fukumoto, Tsutomu Endo, Kazutaka Shimada: "Information Extraction from Specifications on the World Wide Web"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics 2001). 109-116 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kazutaka Shimada, Atsushi Fukumoto, Tsutomu Endo: "Information Extraction from personal Computer Specifications on the Web Using a User's Request"IEICE Transactions on Information and Systems. Vol.E86-b, No.8. 1386-1395 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kazutaka Shimada, Tetsuro Ito, Tsutomu Endo: "Multiform Summarization from Product Specifications"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics. 83-92 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kazutaka Shimada, Koji Hayashi, Tsutomu Endo: "Keyword and Weighting for Product Specifications Extraction"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics. 285-293 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] Kazutaka Shimada, Tsutomu Endo: "Product Specifications Summarization and Product Ranking System using User's Request"Information Modeling and Knowledge Bases, IOS Press. 333 (2004)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2003 Final Research Report Summary
  • [Publications] 嶋田和孝, 福本篤史, 遠藤勉: "Information Extraction from Personal Computer Specifications on the Web Using a User's Request"IEICE Transactions on Information and Systems. Vol.E86-D No.8. 1386-1395 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "Multiform Summarization from Product Specifications"Proceedings of PACLING2003(Pacific Association for Computational Linguistics 2003). 83-92 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 嶋田和孝, 林晃司, 遠藤勉: "Keyword and Weighting for Product Specifications Extraction"Proceedings of PACLING2003(Pacific Association for Computational Linguistics 2003). 285-293 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 遠藤勉: "対話支援型問題解決のための文脈情報処理"教育システム情報学会研究報告. Vol.18 No.3. 23-28 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 関恒仁, 嶋田和孝, 遠藤勉: "Web上の製品性能表における属性クラスタリング"第11回電子情報通信学会九州支部学生会講演会講演論文集. 113 (2003)

    • Related Report
      2003 Annual Research Report
  • [Publications] 林晃司, 嶋田和孝, 遠藤勉: "機械学習を用いたWWWからの製品性能表の分類と抽出"言語処理学会第10回年次大会発表論文集. 733-736 (2004)

    • Related Report
      2003 Annual Research Report
  • [Publications] 林晃司, 嶋田和孝, 遠藤勉: "WWWからの性能表抽出のためのキーワード獲得と重み付け"電子情報通信学会技術研究報告Tl2002-48〜53[思考と言語]. Vol.102 No.688. 13-18 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 林晃司, 嶋田和孝, 遠藤勉: "WWWからの製品性能表抽出"言語処理学会第9回年次大会発表論文集. 377-380 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] 森松俊允, 福本篤史, 嶋田和孝, 遠藤勉: "製品性能表を用いた製品選択支援システムの構築"電気関係学会九州支部第55回連合大会講演論文集. 502 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 福本篤史, 嶋田和孝, 遠藤勉: "製品性能表からの表構造生成"電気関係学会九州支部第55回連合大会講演論文集. 503 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 林晃司, 遠藤勉, 嶋田和孝: "Webページからの製品性能表抽出"第10回電子情報通信学会九州支部学生会講演会. 126 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 片山智央, 嶋田和孝, 遠藤勉: "Web上の画像からの製品領域の抽出"第10回電子情報通信学会九州支部学生会講演会. 141 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "Classification of Images Using Their Neighboring Sentences"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics 2001). 250-256 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 福本篤史, 遠藤勉, 嶋田和孝: "Information Extraction from Specifications on the World Wide Web"Proceedings of PACLING2001 (Pacific Association for Computational Linguistics 2001). 109-116 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 賀川経夫, 遠藤勉: "ペンを利用したマルチモーダル対話におけるペン操作の生成"電子情報通信学会技術研究報告[TL2001-1〜8]. Vol.101 No.61. 9-16 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "入力質問文を用いた動的な特徴抽出処理"電子情報通信学会技術研究報告[TL2001-1〜8]. Vol.101 No.61. 43-50 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 嶋田和孝, 伊藤哲郎, 遠藤勉: "表からの特徴抽出処理-他種類データヘの適用-"電子情報通信学会技術研究報告[TL2001-29〜34]. Vol.101 No.485. 27-34 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 賀川経夫, 遠藤勉: "テキストを対象とした発話に伴うペン操作の解析に関する一検討"電子情報通信学会技術研究報告[TL2001-29〜34]. Vol.101 No.485. 35-42 (2001)

    • Related Report
      2001 Annual Research Report

URL: 

Published: 2001-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi