• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Recognition of Presentation by Integration of Visual and Linguistic Information

Research Project

Project/Area Number 06452396
Research Category

Grant-in-Aid for General Scientific Research (B)

Allocation TypeSingle-year Grants
Research Field Intelligent informatics
Research InstitutionUniversity of Tsukuba

Principal Investigator

OHTA Yuichi  Univ.of Tsukuba, Inst.of Inf.Sci., Professor, 電子・情報工学系, 教授 (50115804)

Co-Investigator(Kenkyū-buntansha) NAKAMURA Yuichi  Univ.of Tsukuba, Inst.of Inf.Sci., Assistant Professor, 電子・情報工学系, 講師 (40227947)
Project Period (FY) 1994 – 1995
Project Status Completed (Fiscal Year 1995)
Budget Amount *help
¥5,500,000 (Direct Cost: ¥5,500,000)
Fiscal Year 1995: ¥1,700,000 (Direct Cost: ¥1,700,000)
Fiscal Year 1994: ¥3,800,000 (Direct Cost: ¥3,800,000)
KeywordsVideo Indexing / Gesture Dialog Relation / Human Behavior Understanding / Image Understanding / Media Integration / Natural Language Processing / Human Interface / Presentation / 画像認識 / プレゼンテーションの理解 / 動画像処理 / ジェスチャー認識 / 自然言語とパターン情報の統合
Research Abstract

In the understanding of human behaviors, there is a lot of ambiguities which depend on the situation. It is because few strict laws or rules are applicable throughout a wide variety of situations. We have examined the relationship between the gestures and the context of situation which typically represented in the spoken dialog. We proposed a novel framework to understand the behaviro in oral presentations.
1. Human behavior understanding in oral presentation : We developed a method to extract visual keys from presentation images. We also developed a method to extract linguistic keys from spoken words in the presentation. A novel framework was developed to integrate the both keys to resolve the intention of the presenter.
2. Temporal structure analysis of video by image and sound processing : We have developed a method to estimate the temporal structure of a video sequence considering the contents and the intention of the author. it uses the visual and sound keys. Television commercials are used as the target presentation images.
3. Knowledge extraction from diagram and text : We developed a new framework for knowledge extraction from written texts and diagrams and utilization of the obtained knowledge for the automatic organization of flexible hyper-media.

Report

(3 results)
  • 1995 Annual Research Report   Final Research Report Summary
  • 1994 Annual Research Report
  • Research Products

    (23 results)

All Other

All Publications (23 results)

  • [Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報 パターン認識・理解. 95-143. 51-56 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報 パターン認識・理解. 95-159. 9-12 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 向川康博: "Synthesis of Arbitrarily Iriented Face Views from Two Omages" Asian Conference on Computer Vision, Singapore. 3. 718-722 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 角 保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1995)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現 --ウェーブレット極値による形状生成/編集--" 信学技報 パターン認識・理解. 95-172. 39-46 (1996)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yuichi Nakamura, Masashi Nishitani, Yuichi Ohta: "Human Behavior Understanding in Oral Presentation" IEICE Technical Report SIG-PRU. Vol.95-143. 51-56 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yukiyo Uehori, Mitsuhiro Murata, Yuichi Nakamura, Yuichi Ohta: "Temporal Structure Analysis of Television Commercial by Image and Sound Processing" IEICE Technical Report SIG-PRU. Vol.95-159. 9-12 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yasuhiro Mukaigawa, Yuichi Nakamura, Yuichi Ohta: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conf.On Computer Vision. Vol.3. 718-722 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yasushi Sumi, Yuichi Ohta: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int.Workshop Automatic Face-and Gesture Recognition. 254-259 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yuichi Nakamura, Miwa Takahashi, Masayuki Onda, Yuichi Ohta: "Knowledge Extraction from Diagram and Text for Media Integration" IEEE Multimedia Computing and Systems. (to be published). (1996)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] Yuichi Nakamura, Yuichi Ohta: "Multiresolutional Pattern Description for Bi-directional Analysis" IEICE Technical Report SIG-PRU. Vol.95-172. 39-46 (1995)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      1995 Final Research Report Summary
  • [Publications] 中村 裕一: "プレゼンテーション映像における話者の行動理解" 信学技報 パターン認識・理解. 95-143. 51-56 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] 上堀 幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報 パターン認識・理解. 95-159. 9-12 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] 向川 康博: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conference on Computer Vision,Singapore. 3. 718-722 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] 角 保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] 中村 裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1996)

    • Related Report
      1995 Annual Research Report
  • [Publications] 中村 裕一: "認識と生成を双方向に行なうための多重解像度表現-ウェーブレット極値による形状生成/編集-" 信学技報 パターン認識・理解. 95-172. 39-46 (1995)

    • Related Report
      1995 Annual Research Report
  • [Publications] Satoh,K.,and Ohta,Y.: "Passive Depth Acquisition for 3D Image Displays" IEICE Trans.on Information and Systems. E77-D. 949-957 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] 角 保志,大田 友一: "分散型2次元モデルに基づく顔画像の解析" 電子情報通信学会論文誌. 77D-II. 2342-2352 (1994)

    • Related Report
      1994 Annual Research Report
  • [Publications] 村田 充弘,中村 裕一,大田 友一: "画像におけるカット変わりの自動検出" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-7 (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] 村田 充弘,中村 裕一,大田 友一: "画像処理を用いた映像の時間的構造化" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-8 (1995)

    • Related Report
      1994 Annual Research Report
  • [Publications] 西谷 正志,中村 裕一,大田 友一: "プレゼンテーション映像における発話内容を用いた話者の動作理解" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 7D-4 (1995)

    • Related Report
      1994 Annual Research Report

URL: 

Published: 1994-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi