2011 Fiscal Year Final Research Report

Cooperative Understanding of Speeches and Images Using Multiple Recognizer and Its Application to Multimodal Dialogue System

Research Project

Project/Area Number	21500143
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Kyushu Institute of Technology
Principal Investigator	ENDO Tsutomu 九州工業大学, 大学院・情報工学研究院, 教授 (10112294)
Co-Investigator(Kenkyū-buntansha)	SHIMADA Kazutaka 九州工業大学, 大学院情報工学研究院, 助教 (50346863)
Project Period (FY)	2009 – 2011
Keywords	自然言語処理 / マルチモーダルインタフェース / 音声理解 / ジェスチャ認識
Research Abstract	We proposed a wide variety of methods to integrate several ap-proaches and features for multimodal dialogue systems. We developed a Web based image retrieval system using linguistic and image features first. We also realized a multiple speech recognizer with hierarchical relations. For hand posture recognition, we combined online and offline machine learning techniques. We introduced context features and top-view images to person identification.

Research Products
(9 results)

All 2012 2011 2010 2009 Other

All Journal Article (1 results) (of which Peer Reviewed: 1 results) Presentation (7 results) Remarks (1 results)

[Journal Article] Recurrent Neural Network Classifier for Three Layer Conceptual Network and Performance2010
- Author(s)
  Md. Khalilur Lhaman and Tsutomu Endo
- Journal Title
  
  JOURNAL OF COMPUTERS
  
  Volume: Vol.5, No.1 Pages: 40-48
- Peer Reviewed
[Presentation] 複数人談話における言語情報と非言語情報を利用した盛り上がり判定2012
- Author(s)
  横山貴彦, 嶋田和孝, 遠藤勉
- Organizer
  言語処理学会第18回年次大会(NLP2012)
- Place of Presentation
  広島市立大学
- Year and Date
  2012-03-14
[Presentation] A Combined Method Based on SVM and Online Learning with HOG for Hand Shape Recognition2011
- Author(s)
  Kazutaka Shimada, Ryosuke Muto and Tsutomu Endo
- Organizer
  The 2nd Interna-tional Workshop on Advanced Computa-tional Intelligence and Intelligent Informatics(IWACIII2011)
- Place of Presentation
  蘇州大学(中国)
- Year and Date
  2011-11-21
[Presentation] A Person Identification Method Using a Top-view Head Image from an Overhead Camera2011
- Author(s)
  Ryota Nakatani, Daichi Kouno, Kazutaka Shimada and Tsutomu Endo
- Organizer
  The 2nd International Workshop on Advanced Computational Intelligence and Intel-ligent Informatics(IWACIII2011)
- Place of Presentation
  蘇州大学(中国)
- Year and Date
  2011-11-21
[Presentation] A person identification method using facial, clothing and time feature2011
- Author(s)
  Kazuaki Komatsu, Kazutaka Shimada and Tsutomu Endo
- Organizer
  The 2nd International Workshop on Advanced Computational Intelligence and Intelligent Informatics(IWACIII2011)
- Place of Presentation
  蘇州大学(中国)
- Year and Date
  2011-11-20
[Presentation] A Hierarchical Multiple Recognizer for Robust Speech Under-standing2010
- Author(s)
  Takahiko Yokoyama, Kazutaka Shimada and Tsutomu Endo
- Organizer
  The Pacific Rim International Conference on Artificial Intelligence(PRICAI 2010)
- Place of Presentation
  ノボテルホテル大邱(韓国)
- Year and Date
  2010-08-31
[Presentation] Web image retrieval for abstract queries using text and image information2009
- Author(s)
  Kazutaka Shimada, Suguru Ishikawa and Tsutomu Endo
- Organizer
  The Fifth Asia Information Retrieval Symposium(AIRS 2009)
- Place of Presentation
  北海道大学
- Year and Date
  2009-10-22
[Presentation] Effective construction and expansion of a sentiment corpus using an existing corpus and evaluative criteria estimation2009
- Author(s)
  Ryosuke Tadano, Kazutaka Shimada and Tsutomu Endo
- Organizer
  The 11th Confer-ence of the Pacific Association for Computational Linguistics(PACLING2009)
- Place of Presentation
  北海道大学
- Year and Date
  2009-09-03
[Remarks]
- URL
  http://www.pluto.ai.kyutech.ac.jp/plt/endo-lab/index.html

2011 Fiscal Year Final Research Report

Cooperative Understanding of Speeches and Images Using Multiple Recognizer and Its Application to Multimodal Dialogue System

Principal Investigator

ENDO Tsutomu 九州工業大学, 大学院・情報工学研究院, 教授 (10112294)

Research Products

[Journal Article] Recurrent Neural Network Classifier for Three Layer Conceptual Network and Performance2010

Author(s)

Journal Title

[Presentation] 複数人談話における言語情報と非言語情報を利用した盛り上がり判定2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A Combined Method Based on SVM and Online Learning with HOG for Hand Shape Recognition2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A Person Identification Method Using a Top-view Head Image from an Overhead Camera2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A person identification method using facial, clothing and time feature2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] A Hierarchical Multiple Recognizer for Robust Speech Under-standing2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Web image retrieval for abstract queries using text and image information2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Effective construction and expansion of a sentiment corpus using an existing corpus and evaluative criteria estimation2009

Author(s)

Organizer

Place of Presentation

Year and Date

[Remarks]

URL