Recognition of Presentation by Integration of Visual and Linguistic Information

Research Project

Project/Area Number	06452396
Research Category	Grant-in-Aid for General Scientific Research (B)
Allocation Type	Single-year Grants
Research Field	Intelligent informatics
Research Institution	University of Tsukuba
Principal Investigator	OHTA Yuichi Univ.of Tsukuba, Inst.of Inf.Sci., Professor, 電子・情報工学系, 教授 (50115804)
Co-Investigator(Kenkyū-buntansha)	NAKAMURA Yuichi Univ.of Tsukuba, Inst.of Inf.Sci., Assistant Professor, 電子・情報工学系, 講師 (40227947)
Project Period (FY)	1994 – 1995
Project Status	Completed (Fiscal Year 1995)
Budget Amount *help	¥5,500,000 (Direct Cost: ¥5,500,000) Fiscal Year 1995: ¥1,700,000 (Direct Cost: ¥1,700,000) Fiscal Year 1994: ¥3,800,000 (Direct Cost: ¥3,800,000)
Keywords	Video Indexing / Gesture Dialog Relation / Human Behavior Understanding / Image Understanding / Media Integration / Natural Language Processing / Human Interface / Presentation / 画像認識 / プレゼンテーションの理解 / 動画像処理 / ジェスチャー認識 / 自然言語とパターン情報の統合
Research Abstract	In the understanding of human behaviors, there is a lot of ambiguities which depend on the situation. It is because few strict laws or rules are applicable throughout a wide variety of situations. We have examined the relationship between the gestures and the context of situation which typically represented in the spoken dialog. We proposed a novel framework to understand the behaviro in oral presentations. 1. Human behavior understanding in oral presentation : We developed a method to extract visual keys from presentation images. We also developed a method to extract linguistic keys from spoken words in the presentation. A novel framework was developed to integrate the both keys to resolve the intention of the presenter. 2. Temporal structure analysis of video by image and sound processing : We have developed a method to estimate the temporal structure of a video sequence considering the contents and the intention of the author. it uses the visual and sound keys. Television commercials are used as the target presentation images. 3. Knowledge extraction from diagram and text : We developed a new framework for knowledge extraction from written texts and diagrams and utilization of the obtained knowledge for the automatic organization of flexible hyper-media.

Report

(3 results)

1995 Annual Research Report Final Research Report Summary
1994 Annual Research Report

Research Products
(23 results)

All Other

All Publications (23 results)

[Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報パターン認識・理解. 95-143. 51-56 (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報パターン認識・理解. 95-159. 9-12 (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 向川康博: "Synthesis of Arbitrarily Iriented Face Views from Two Omages" Asian Conference on Computer Vision, Singapore. 3. 718-722 (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 角保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1995)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現 --ウェーブレット極値による形状生成/編集--" 信学技報パターン認識・理解. 95-172. 39-46 (1996)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yuichi Nakamura, Masashi Nishitani, Yuichi Ohta: "Human Behavior Understanding in Oral Presentation" IEICE Technical Report SIG-PRU. Vol.95-143. 51-56 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yukiyo Uehori, Mitsuhiro Murata, Yuichi Nakamura, Yuichi Ohta: "Temporal Structure Analysis of Television Commercial by Image and Sound Processing" IEICE Technical Report SIG-PRU. Vol.95-159. 9-12 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yasuhiro Mukaigawa, Yuichi Nakamura, Yuichi Ohta: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conf.On Computer Vision. Vol.3. 718-722 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yasushi Sumi, Yuichi Ohta: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int.Workshop Automatic Face-and Gesture Recognition. 254-259 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yuichi Nakamura, Miwa Takahashi, Masayuki Onda, Yuichi Ohta: "Knowledge Extraction from Diagram and Text for Media Integration" IEEE Multimedia Computing and Systems. (to be published). (1996)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] Yuichi Nakamura, Yuichi Ohta: "Multiresolutional Pattern Description for Bi-directional Analysis" IEICE Technical Report SIG-PRU. Vol.95-172. 39-46 (1995)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  1995 Final Research Report Summary
[Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報パターン認識・理解. 95-143. 51-56 (1995)
- Related Report
  1995 Annual Research Report
[Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報パターン認識・理解. 95-159. 9-12 (1995)
- Related Report
  1995 Annual Research Report
[Publications] 向川康博: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conference on Computer Vision,Singapore. 3. 718-722 (1995)
- Related Report
  1995 Annual Research Report
[Publications] 角保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)
- Related Report
  1995 Annual Research Report
[Publications] 中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1996)
- Related Report
  1995 Annual Research Report
[Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現-ウェーブレット極値による形状生成/編集-" 信学技報パターン認識・理解. 95-172. 39-46 (1995)
- Related Report
  1995 Annual Research Report
[Publications] Satoh,K.,and Ohta,Y.: "Passive Depth Acquisition for 3D Image Displays" IEICE Trans.on Information and Systems. E77-D. 949-957 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 角保志,大田友一: "分散型2次元モデルに基づく顔画像の解析" 電子情報通信学会論文誌. 77D-II. 2342-2352 (1994)
- Related Report
  1994 Annual Research Report
[Publications] 村田充弘,中村裕一,大田友一: "画像におけるカット変わりの自動検出" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-7 (1995)
- Related Report
  1994 Annual Research Report
[Publications] 村田充弘,中村裕一,大田友一: "画像処理を用いた映像の時間的構造化" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-8 (1995)
- Related Report
  1994 Annual Research Report
[Publications] 西谷正志,中村裕一,大田友一: "プレゼンテーション映像における発話内容を用いた話者の動作理解" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 7D-4 (1995)
- Related Report
  1994 Annual Research Report

Recognition of Presentation by Integration of Visual and Linguistic Information

Principal Investigator

OHTA Yuichi Univ.of Tsukuba, Inst.of Inf.Sci., Professor, 電子・情報工学系, 教授 (50115804)

¥5,500,000 (Direct Cost: ¥5,500,000)

Report

Research Products

[Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報 パターン認識・理解. 95-143. 51-56 (1995)

Description

Related Report

[Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報 パターン認識・理解. 95-159. 9-12 (1995)

Description

Related Report

[Publications] 向川康博: "Synthesis of Arbitrarily Iriented Face Views from Two Omages" Asian Conference on Computer Vision, Singapore. 3. 718-722 (1995)

Description

Related Report

[Publications] 角 保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

Description

Related Report

[Publications] 中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1995)

Description

Related Report

[Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現 --ウェーブレット極値による形状生成/編集--" 信学技報 パターン認識・理解. 95-172. 39-46 (1996)

Description

Related Report

[Publications] Yuichi Nakamura, Masashi Nishitani, Yuichi Ohta: "Human Behavior Understanding in Oral Presentation" IEICE Technical Report SIG-PRU. Vol.95-143. 51-56 (1995)

Description

Related Report

[Publications] Yukiyo Uehori, Mitsuhiro Murata, Yuichi Nakamura, Yuichi Ohta: "Temporal Structure Analysis of Television Commercial by Image and Sound Processing" IEICE Technical Report SIG-PRU. Vol.95-159. 9-12 (1995)

Description

Related Report

[Publications] Yasuhiro Mukaigawa, Yuichi Nakamura, Yuichi Ohta: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conf.On Computer Vision. Vol.3. 718-722 (1995)

Description

Related Report

[Publications] Yasushi Sumi, Yuichi Ohta: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int.Workshop Automatic Face-and Gesture Recognition. 254-259 (1995)

Description

Related Report

[Publications] Yuichi Nakamura, Miwa Takahashi, Masayuki Onda, Yuichi Ohta: "Knowledge Extraction from Diagram and Text for Media Integration" IEEE Multimedia Computing and Systems. (to be published). (1996)

Description

Related Report

[Publications] Yuichi Nakamura, Yuichi Ohta: "Multiresolutional Pattern Description for Bi-directional Analysis" IEICE Technical Report SIG-PRU. Vol.95-172. 39-46 (1995)

Description

Related Report

[Publications] 中村 裕一: "プレゼンテーション映像における話者の行動理解" 信学技報 パターン認識・理解. 95-143. 51-56 (1995)

Related Report

[Publications] 上堀 幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報 パターン認識・理解. 95-159. 9-12 (1995)

Related Report

[Publications] 向川 康博: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conference on Computer Vision,Singapore. 3. 718-722 (1995)

Related Report

[Publications] 角 保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

Related Report

[Publications] 中村 裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1996)

Related Report

[Publications] 中村 裕一: "認識と生成を双方向に行なうための多重解像度表現-ウェーブレット極値による形状生成/編集-" 信学技報 パターン認識・理解. 95-172. 39-46 (1995)

Related Report

[Publications] Satoh,K.,and Ohta,Y.: "Passive Depth Acquisition for 3D Image Displays" IEICE Trans.on Information and Systems. E77-D. 949-957 (1994)

Related Report

[Publications] 角 保志,大田 友一: "分散型2次元モデルに基づく顔画像の解析" 電子情報通信学会論文誌. 77D-II. 2342-2352 (1994)

Related Report

[Publications] 村田 充弘,中村 裕一,大田 友一: "画像におけるカット変わりの自動検出" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-7 (1995)

Related Report

[Publications] 村田 充弘,中村 裕一,大田 友一: "画像処理を用いた映像の時間的構造化" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-8 (1995)

Related Report

[Publications] 西谷 正志,中村 裕一,大田 友一: "プレゼンテーション映像における発話内容を用いた話者の動作理解" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 7D-4 (1995)

Related Report

[Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報パターン認識・理解. 95-143. 51-56 (1995)

[Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報パターン認識・理解. 95-159. 9-12 (1995)

[Publications] 角保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

[Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現 --ウェーブレット極値による形状生成/編集--" 信学技報パターン認識・理解. 95-172. 39-46 (1996)

[Publications] 中村裕一: "プレゼンテーション映像における話者の行動理解" 信学技報パターン認識・理解. 95-143. 51-56 (1995)

[Publications] 上堀幸代: "画像特徴を音響特徴を利用したCM映像の自動的構造化手法" 信学技報パターン認識・理解. 95-159. 9-12 (1995)

[Publications] 向川康博: "Synthesis of Arbitrarily Oriented Face Views from Two Images" Asian Conference on Computer Vision,Singapore. 3. 718-722 (1995)

[Publications] 角保志: "Detection of Face Orientation and Facial Components Using Distributed Appearance Model" Int. Workshop on Automatic Face-and Gesture Recognition. 254-259 (1995)

[Publications] 中村裕一: "Knowledge Extraction from Diagram and Text for Media Integration" Proc. Int. Conference on Multimedia Computing and Systems. (to be published). (1996)

[Publications] 中村裕一: "認識と生成を双方向に行なうための多重解像度表現-ウェーブレット極値による形状生成/編集-" 信学技報パターン認識・理解. 95-172. 39-46 (1995)

[Publications] 角保志,大田友一: "分散型2次元モデルに基づく顔画像の解析" 電子情報通信学会論文誌. 77D-II. 2342-2352 (1994)

[Publications] 村田充弘,中村裕一,大田友一: "画像におけるカット変わりの自動検出" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-7 (1995)

[Publications] 村田充弘,中村裕一,大田友一: "画像処理を用いた映像の時間的構造化" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 6D-8 (1995)

[Publications] 西谷正志,中村裕一,大田友一: "プレゼンテーション映像における発話内容を用いた話者の動作理解" 情報処理学会第50回(平成7年前期)全国大会講演予稿集. (掲載予定). 7D-4 (1995)