Project/Area Number |
15300065
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Perception information processing/Intelligent robotics
|
Research Institution | Waseda University |
Principal Investigator |
KOBAYASHI Tetsunori Waseda University, Faculty of Science and Engineering, Professor, 理工学術院, 教授 (30162001)
|
Co-Investigator(Kenkyū-buntansha) |
FUJIE Shinya Waseda University, Faculty of Science and Engineering, Visiting Lecturer(Full-Time), 理工学術院, 客員講師(専任) (00367062)
OGAWA Tetsuji Waseda University, Faculty of Science and Engineering, Research Associate, 理工学術院, 助手 (70386598)
松坂 要佐 日本学術振興会, 特別研究員(PD)
|
Project Period (FY) |
2003 – 2006
|
Project Status |
Completed (Fiscal Year 2006)
|
Budget Amount *help |
¥13,300,000 (Direct Cost: ¥13,300,000)
Fiscal Year 2006: ¥1,900,000 (Direct Cost: ¥1,900,000)
Fiscal Year 2005: ¥3,200,000 (Direct Cost: ¥3,200,000)
Fiscal Year 2004: ¥3,800,000 (Direct Cost: ¥3,800,000)
Fiscal Year 2003: ¥4,400,000 (Direct Cost: ¥4,400,000)
|
Keywords | Spoken dialogue / Para-language / Prosody / Gesture recognition / Conversation Robot |
Research Abstract |
As a tool for investigating fundamental elements of natural spoken language communication, a prototype of spoken dialogue system with understanding and generating functions of linguistic and para-linguistic information was developed. Although many excellent studies on speech recognition and synthesis have been conducted, there exists no practical spoken dialogue system which satisfies us. One of the reasons is that most spoken dialogue systems did not deal with para-linguistic information. The quantitative understanding for para-linguistic information is not sufficient enough to make natural conversation system. In this study, we tried realizing many component technologies and a platform of conversation robot as tools to reveal the quantitative rolls of para-language. In particular, the following outcomes were obtained. 1) the sound localization and separation methods using the four-line directivity microphone mounted on head of robot, 2) the high quality speech synthesis method based on the waveform synthesis and the high quality voice conversion method for expressing para-linguistic information, 3) the method of attitude recognition and back-channel feedback generation based on the prosodic information as para-linguistic information in speech information, 4) the method of head gesture recognition and facial expression recognition as para-linguistic information in visual information, 5) humanoid robot "ROBISUKE" developed as the platform of the spoken dialogue system, and 6) Message Oriented RObot Architecture, MONEA, proposed for the integration of the abovementioned modules. Future work includes the experiment for finding out the necessary requirement for natural conversation quantitatively.
|