• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2003 Fiscal Year Final Research Report Summary

User's Intention Understanding Using Multi-modal Information for Intelligent Interfaces

Research Project

Project/Area Number 13680471
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionFukuoka University

Principal Investigator

TSURUTA Naoyuki  Fukuoka Univ., Faculty of Engineering, Associate Professor, 工学部, 助教授 (60227478)

Co-Investigator(Kenkyū-buntansha) MAEDA Sakashi  Fukuoka Univ., Faculty of Engineering, Research Associate, 工学部, 助手 (90330901)
MORIMOTO Tsuyoshi  Fukuoka Univ., Faculty of Engineering, Professor, 工学部, 教授 (10309891)
Project Period (FY) 2001 – 2003
KeywordsUser Interface / Multi-modal / Intention Understanding / Dialogue System / Image Recognition / Lip-reading / Spoken Language Understanding
Research Abstract

This research developed a framework of user's intention understanding using multi-modal information, and enabled natural and robust, intelligent man-machine interface.
In generally, intention understanding processes are constructed from following three stages. (A) Tracking of humans walking around the system. (B) Detection of a human coming up to the system, and confirmation of one's intention of use the system. (C) User's intention understanding for man-machine dialogue. Traditional researches focused on only one of those stages but not transition between stages. Therefore, natural dialogue interface is not developed yet.
This research focused on (B), (C) and transitions between them and got following three results, while results of traditional researches were used for (A). (1) When the system detects a human coming up to, the system gathers information using a new active-vision method and confirm his/her intention implicitly. This implicit confirmation enables natural transitions from (B) to (C). (2) In the stage (C), a combination of a vision based lip-reading and a. context analysis with the traditional spoken language recognition, which enables high recognition accuracy, was proposed. Using the proposed methods for (B) and (C), a very robust dialogue system could be developed. (3) The recognition accuracy for dialogues, however, was very high but not perfect. Therefore, a touch-panel device and menus on it were additionally introduced, and a new modal switching method was proposed. Using this method, user can communicate with the system using audio-visual dialogue as frequently as possible under being premised on a perfect success.

  • Research Products

    (12 results)

All Other

All Publications (12 results)

  • [Publications] Tarek EL.Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] N.Tsuruta: "Self-organizing Feature Maps for HMM Based Lip-reading"Lecture noteo in computer science, Springer. 2774. 162-168 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Tarek EL.Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Tarek EL. Tobely: "The Competitive Algorithm of The Hypercolumn Neural Network Toward Real-time Image Recognition"Journal of Neural Network World. 1-03. 15-39 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Tsuruta: "Self organizing Feature Maps for HMM Based Lip -reading"Lecture notes in computer science, Springer. 2774. 162-168 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S.Takahashi: "Robust Speech Understanding Based on Expected Discourse Plan"Proc.of the EUROSPEECH. 1. 661-664 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S.Takahashi: "Dialogue Experiment for Elderly People in Home Health Care System"Proc.of the TSD2003. 1. 418-423 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Tarek EL. Tobely: "A Randomized Model of Hypercolumn Neural Network for Gesture Recognition"Journal of Computers, Systems and Signals. 3-1. 14-28 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Tsuruta: "Randomized Self-organizing Maps for Gesture Recognition"Journal of Jap an Society for Fuzzy Theory and Systems. 14-1. 82-87 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] N.Tsuruta: "A Randomized Hypercolumn Model and Gesture Recognition"Lecture notes in computer science, (Springer(IWANN2001)). 2084. 235-242 (2001)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2005-04-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi