2003 Fiscal Year Final Research Report Summary

User's Intention Understanding Using Multi-modal Information for Intelligent Interfaces

Research Project

Project/Area Number	13680471
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Fukuoka University
Principal Investigator	TSURUTA Naoyuki Fukuoka Univ., Faculty of Engineering, Associate Professor, 工学部, 助教授 (60227478)
Co-Investigator(Kenkyū-buntansha)	MAEDA Sakashi Fukuoka Univ., Faculty of Engineering, Research Associate, 工学部, 助手 (90330901) MORIMOTO Tsuyoshi Fukuoka Univ., Faculty of Engineering, Professor, 工学部, 教授 (10309891)
Project Period (FY)	2001 – 2003
Keywords	User Interface / Multi-modal / Intention Understanding / Dialogue System / Image Recognition / Lip-reading / Spoken Language Understanding
Research Abstract	This research developed a framework of user's intention understanding using multi-modal information, and enabled natural and robust, intelligent man-machine interface. In generally, intention understanding processes are constructed from following three stages. (A) Tracking of humans walking around the system. (B) Detection of a human coming up to the system, and confirmation of one's intention of use the system. (C) User's intention understanding for man-machine dialogue. Traditional researches focused on only one of those stages but not transition between stages. Therefore, natural dialogue interface is not developed yet. This research focused on (B), (C) and transitions between them and got following three results, while results of traditional researches were used for (A). (1) When the system detects a human coming up to, the system gathers information using a new active-vision method and confirm his/her intention implicitly. This implicit confirmation enables natural transitions from (B) to (C). (2) In the stage (C), a combination of a vision based lip-reading and a. context analysis with the traditional spoken language recognition, which enables high recognition accuracy, was proposed. Using the proposed methods for (B) and (C), a very robust dialogue system could be developed. (3) The recognition accuracy for dialogues, however, was very high but not perfect. Therefore, a touch-panel device and menus on it were additionally introduced, and a new modal switching method was proposed. Using this method, user can communicate with the system using audio-visual dialogue as frequently as possible under being premised on a perfect success.

Research Products
(12 results)

All Other

All Publications (12 results)

[Publications] Tarek EL.Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] N.Tsuruta: "Self-organizing Feature Maps for HMM Based Lip-reading"Lecture noteo in computer science, Springer. 2774. 162-168 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Tarek EL.Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)
- Description
  「研究成果報告書概要(和文)」より
[Publications] Tarek EL. Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] N.Tsuruta: "Self organizing Feature Maps for HMM Based Lip -reading"Lecture notes in computer science, Springer. 2774. 162-168 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] Tarek EL. Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] N.Tsuruta: "Randomized Self-organizing Maps for Gesture Recognition"Journal of Jap an Society for Fuzzy Theory and Systems. 14-1. 82-87 (2002)
- Description
  「研究成果報告書概要(欧文)」より
[Publications] N.Tsuruta: "A Randomized Hypercolumn Model and Gesture Recognition"Lecture notes in computer science, (Springer(IWANN2001)). 2084. 235-242 (2001)
- Description
  「研究成果報告書概要(欧文)」より

2003 Fiscal Year Final Research Report Summary

User's Intention Understanding Using Multi-modal Information for Intelligent Interfaces

Principal Investigator

TSURUTA Naoyuki Fukuoka Univ., Faculty of Engineering, Associate Professor, 工学部, 助教授 (60227478)

Research Products

[Publications] Tarek EL.Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)

Description

[Publications] N.Tsuruta: "Self-organizing Feature Maps for HMM Based Lip-reading"Lecture noteo in computer science, Springer. 2774. 162-168 (2003)

Description

[Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)

Description

[Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)

Description

[Publications] Tarek EL.Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)

Description

[Publications] Tarek EL. Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)

Description

[Publications] N.Tsuruta: "Self organizing Feature Maps for HMM Based Lip -reading"Lecture notes in computer science, Springer. 2774. 162-168 (2003)

Description

[Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)

Description

[Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)

Description

[Publications] Tarek EL. Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)

Description

[Publications] N.Tsuruta: "Randomized Self-organizing Maps for Gesture Recognition"Journal of Jap an Society for Fuzzy Theory and Systems. 14-1. 82-87 (2002)

Description

[Publications] N.Tsuruta: "A Randomized Hypercolumn Model and Gesture Recognition"Lecture notes in computer science, (Springer(IWANN2001)). 2084. 235-242 (2001)

Description