2011 Fiscal Year Final Research Report
Cooperative Understanding of Speeches and Images Using Multiple Recognizer and Its Application to Multimodal Dialogue System
Project/Area Number |
21500143
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Kyushu Institute of Technology |
Principal Investigator |
ENDO Tsutomu 九州工業大学, 大学院・情報工学研究院, 教授 (10112294)
|
Co-Investigator(Kenkyū-buntansha) |
SHIMADA Kazutaka 九州工業大学, 大学院情報工学研究院, 助教 (50346863)
|
Project Period (FY) |
2009 – 2011
|
Keywords | 自然言語処理 / マルチモーダルインタフェース / 音声理解 / ジェスチャ認識 |
Research Abstract |
We proposed a wide variety of methods to integrate several ap-proaches and features for multimodal dialogue systems. We developed a Web based image retrieval system using linguistic and image features first. We also realized a multiple speech recognizer with hierarchical relations. For hand posture recognition, we combined online and offline machine learning techniques. We introduced context features and top-view images to person identification.
|
Research Products
(9 results)