Cooperative Understanding of Speeches and Images Using Multiple Recognizer and Its Application to Multimodal Dialogue System
Project/Area Number |
21500143
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Intelligent informatics
|
Research Institution | Kyushu Institute of Technology |
Principal Investigator |
ENDO Tsutomu 九州工業大学, 大学院・情報工学研究院, 教授 (10112294)
|
Co-Investigator(Kenkyū-buntansha) |
SHIMADA Kazutaka 九州工業大学, 大学院情報工学研究院, 助教 (50346863)
|
Project Period (FY) |
2009 – 2011
|
Project Status |
Completed (Fiscal Year 2011)
|
Budget Amount *help |
¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)
Fiscal Year 2011: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2010: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2009: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
|
Keywords | 自然言語処理 / マルチモーダルインタフェース / 音声理解 / ジェスチャ認識 / マルチモーダル / 文脈処理 / 複合認識器 / 人物識別 / 頭上画像 / マルチモーダルインターフェース / 情報統合 / 複数認識器 |
Research Abstract |
We proposed a wide variety of methods to integrate several ap-proaches and features for multimodal dialogue systems. We developed a Web based image retrieval system using linguistic and image features first. We also realized a multiple speech recognizer with hierarchical relations. For hand posture recognition, we combined online and offline machine learning techniques. We introduced context features and top-view images to person identification.
|
Report
(4 results)
Research Products
(29 results)