2000 Fiscal Year Final Research Report Summary

An Investigation of Cooperative Understanding of Utterances and Gestures Based on Interaction in Semantics Level

Research Project

Project/Area Number	10680388
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Kyushu Institute of Technology (2000) Oita University (1998-1999)
Principal Investigator	ENDO Tsutomu Kyushu Institute of Technology, Department of Artificial Intelligence, Professor, 情報工学部, 教授 (10112294)
Co-Investigator(Kenkyū-buntansha)	KAGAWA Tsuneo Oita University, Department of Computer Science and Intelligent Systems, Research Associate, 工学部, 助手 (90253773)
Project Period (FY)	1998 – 2000
Keywords	Semantic analysis / multimedia / multimodal / Human interface / Cooperative understanding / Information Integration / Gesture recognition / Natural language understanding
Research Abstract	We are developing a problem solving and knowledge acquisition system based on co-reference between drill texts and dialogue with a teacher, focusing on first-grade mathematics. This research proposed a method of cooperative understanding of utterances and gestures. (1) Contextual information processing. We defined the context of dialogue, which consists of surface and case structure of utterances, intention and attention of the speaker, situation of dialogue, and world knowledge. We then presented the algorithms of generating utterances from the system as well as interpreting responses from the teacher using contextual information. (2) Analysis of gestures and utterances. Our point of interest is the movement of the tip of teacher's pen. We developed a simple input device to detect the three-dimensional coordinates of the tip of pen, and presented the algorithms to extract features from moving points. A feature-based approach is used for gesture recognition. We then proposed a method of parsing word candidates given from speech recognition program. (3) Cooperative understanding of utterances and gestures. We defined a multi-modal semantic representation to describe the meaning of utterances and gestures, and showed how to integrate our algorithms for utterance and gesture analysis. We concluded with an evaluation of the understanding system against the design principles, which provide the basis for the integration of multi-modal information during a dialogue. (4) Generation of gestures in cooperation with utterances. Gestures such as pointing of objects on a drill text or drawing of pictures, are represented by movement of a pen, and are displayed as three-dimensional graphical data. We defined a gesture frame and gesture element as an intermediate representation, and presented algorithms of generating them from the semantic representation with the synchronized phrase..

Research Products
(16 results)

All Other

All Publications (16 results)

[Publications] 賀川経夫,高津勝昭,尾上幸太,遠藤勉: "マルチモーダル対話を用いた問題解決システムにおける音声発話文解析処理"電子情報通信学会技術研究報告. TL98-16. 33-40 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Tsutomu Endo,Tsuneo Kagawa: "Cooperative Understanding of Utterances and Gestures in a Dialogue-Based Problem Solving System"Computational Intelligence. Vol.15 No.2. 152-169 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 阿部慎也,賀川経夫,遠藤勉: "マルチモーダル対話における背景要素との関係構造に基づくジェスチャ解析の一手法"電子情報通信学会技術研究報告. PRMU99-155. 39-46 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 賀川経夫,阿部慎也,遠藤勉: "マルチモーダル対話システムにおける複数モダリティの統合と解釈に関する一検討"情報処理学会「マルチメディア通信と分散処理」ワークショップ論文集. 99巻・18号. 19-24 (1999)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 遠藤勉,賀川経夫,嶋田和孝: "対話支援型問題解決システムにおける発話文生成機構と文脈処理"電子情報通信学会論文誌DII. J83-D-II巻8号. 1783-1795 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] 賀川経夫,遠藤勉: "ペンを利用したマルチモーダル対話におけるペン動作生成の一検討"電子情報通信学会技術研究報告. IE2000-81. 23-30 (2000)
- Description
  「研究成果報告書概要(和文)」より
[Publications] T.Kagawa, K.Kozu, K.Onoue and T.Endo: "A Method of Speech analysis in Multi-modal Dialogue for Problem Solving"Technical Report of IEICE. TL98-16. 33-40 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Kagawa, S.Abe, T.Yoshioka and T.Endo: "A Method of Gesture Analysis Integrating Speech Recognition in Multi-modal Dialogue"Technical Report of IEICE. PRMU98-167. 133-140 (1998)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Endo and T.Kagawa: "Cooperative Understanding of Utterances and Gestures in a Dialogue-Based Problem Solving System"Computational Intelligence. Vol.15, No.2. 152-169 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] S.Abe, T.Kagawa and T.Endo: "A Method of Gesture Analysis Based on Relational Structure to Background in Multi-Modal Dialogue"Technical Report of IEICE. PRMU99-155. 39-46 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Kagawa, M.Morita, Y.Ishibashi and T.Endo: "A Study on Method of Utterance and Gesture Generation for Multimodal Dialogue System"Technical Report of IEICE. PRMU99-156. 47-54 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Sato, S.Maeyamada and T.Endo: "A Study on Multi-spectral Range Finder Calibration"Technical Report of IEICE. PRMU99-164. 101-108 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Kagawa, S.Abe and T.Endo: "A Method of Interpretation of Different Modalities in Multi-Modal Dialogue System"IPSJ Symposium Series. Vol.99, No.18. 19-24 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] K.Shimada and T.Endo: "Sentence Generation from Table Structure of Extracted Important Data"Technical Report of IEICE. TL99-29. 25-31 (1999)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Endo, T.Kagawa and K.Shimada: "Utterance Generation and Contextual Processing in Dialogue-Based Problem Solving System"Trans.of IEICE. Vol.J83-D-II, No.8. 1783-1795 (2000)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] T.Kagawa and T.Endo: "A Study on Method of Gesture Generation for Multimodal Dialogue System with a Pen"Technical Report of IEICE. IE2000-81. 23-30 (2000)
- Description
  「研究成果報告書概要(欧文)」より

2000 Fiscal Year Final Research Report Summary

An Investigation of Cooperative Understanding of Utterances and Gestures Based on Interaction in Semantics Level

Principal Investigator

ENDO Tsutomu Kyushu Institute of Technology, Department of Artificial Intelligence, Professor, 情報工学部, 教授 (10112294)

Research Products

[Publications] 賀川経夫,高津勝昭,尾上幸太,遠藤勉: "マルチモーダル対話を用いた問題解決システムにおける音声発話文解析処理"電子情報通信学会技術研究報告. TL98-16. 33-40 (1999)

Description

[Publications] Tsutomu Endo,Tsuneo Kagawa: "Cooperative Understanding of Utterances and Gestures in a Dialogue-Based Problem Solving System"Computational Intelligence. Vol.15 No.2. 152-169 (1999)

Description

[Publications] 阿部慎也,賀川経夫,遠藤勉: "マルチモーダル対話における背景要素との関係構造に基づくジェスチャ解析の一手法"電子情報通信学会技術研究報告. PRMU99-155. 39-46 (1999)

Description

[Publications] 賀川経夫,阿部慎也,遠藤勉: "マルチモーダル対話システムにおける複数モダリティの統合と解釈に関する一検討"情報処理学会「マルチメディア通信と分散処理」ワークショップ論文集. 99巻・18号. 19-24 (1999)

Description

[Publications] 遠藤勉,賀川経夫,嶋田和孝: "対話支援型問題解決システムにおける発話文生成機構と文脈処理"電子情報通信学会論文誌DII. J83-D-II巻8号. 1783-1795 (2000)

Description

[Publications] 賀川経夫,遠藤勉: "ペンを利用したマルチモーダル対話におけるペン動作生成の一検討"電子情報通信学会技術研究報告. IE2000-81. 23-30 (2000)

Description

[Publications] T.Kagawa, K.Kozu, K.Onoue and T.Endo: "A Method of Speech analysis in Multi-modal Dialogue for Problem Solving"Technical Report of IEICE. TL98-16. 33-40 (1998)

Description

[Publications] T.Kagawa, S.Abe, T.Yoshioka and T.Endo: "A Method of Gesture Analysis Integrating Speech Recognition in Multi-modal Dialogue"Technical Report of IEICE. PRMU98-167. 133-140 (1998)

Description

[Publications] T.Endo and T.Kagawa: "Cooperative Understanding of Utterances and Gestures in a Dialogue-Based Problem Solving System"Computational Intelligence. Vol.15, No.2. 152-169 (1999)

Description

[Publications] S.Abe, T.Kagawa and T.Endo: "A Method of Gesture Analysis Based on Relational Structure to Background in Multi-Modal Dialogue"Technical Report of IEICE. PRMU99-155. 39-46 (1999)

Description

[Publications] T.Kagawa, M.Morita, Y.Ishibashi and T.Endo: "A Study on Method of Utterance and Gesture Generation for Multimodal Dialogue System"Technical Report of IEICE. PRMU99-156. 47-54 (1999)

Description

[Publications] T.Sato, S.Maeyamada and T.Endo: "A Study on Multi-spectral Range Finder Calibration"Technical Report of IEICE. PRMU99-164. 101-108 (1999)

Description

[Publications] T.Kagawa, S.Abe and T.Endo: "A Method of Interpretation of Different Modalities in Multi-Modal Dialogue System"IPSJ Symposium Series. Vol.99, No.18. 19-24 (1999)

Description

[Publications] K.Shimada and T.Endo: "Sentence Generation from Table Structure of Extracted Important Data"Technical Report of IEICE. TL99-29. 25-31 (1999)

Description

[Publications] T.Endo, T.Kagawa and K.Shimada: "Utterance Generation and Contextual Processing in Dialogue-Based Problem Solving System"Trans.of IEICE. Vol.J83-D-II, No.8. 1783-1795 (2000)

Description

[Publications] T.Kagawa and T.Endo: "A Study on Method of Gesture Generation for Multimodal Dialogue System with a Pen"Technical Report of IEICE. IE2000-81. 23-30 (2000)

Description