2010 Fiscal Year Annual Research Report

ユーザの発語行為解釈に基づくロボット音声対話インタフェース

Publicly Offered Research

Project Area	Cyber Infrastructure for the Information-explosion Era
Project/Area Number	21013029
Research Institution	Nagoya University
Principal Investigator	駒谷和範名古屋大学, 工学研究科, 准教授 (40362579)
Keywords	音声対話システム / 音声認識 / ロボット対話 / バージイン / 発話タイミング / 発語行為 / 音源分離 / 独立成分分析
Research Abstract	実環境下での頑健なロボット音声対話の実現を目指して,発話行為レベルの情報に着目して解釈を行う音声対話システムを構築した.つまり,音声認識結果として得られる発話内容以外に,発話タイミングや発話間の沈黙を併用してユーザの発話を解釈する.これにより,周辺雑音の多い状況でも頑健にユーザの意図を推定可能なロボット音声対話の実現を目指す.本年度は具体的には下記2点に取り組んだ. (1)ユーザや列挙項目に応じた最適な解釈重みの決定音声認識結果と発話タイミングから得られる確率を足し合わせる際の重みを,ユーザや列挙内容に応じて変化させることで,指示対象の同定精度を向上させた.これは,内容語を用いた発話を好むユーザと,タイミングによる対象指示を好むユーザが存在するという分析結果に基づいており,それぞれに応じて事前情報を与えることに相当する. (2)対話状況に応じた列挙型対話への切り替えレストラン検索を行う音声対話システムに,タイミングを用いた解釈法を実装した.この際に,音源分離部の信号雑音比や同一発話の繰り返し回数などを用いて,対話の進行が困難な状況を検出する.これにより,正しい認識結果を得るのが難しい場合はタイミングを用いた対話に誘導し,音声認識結果が信頼できそうな場合にはそれを用いて解釈を行うといった適応的な対話を実現した.この手法について,高い音声認識性能を得るのが困難な環境下で評価実験を行い,タスク達成率が向上することを示した.

Research Products
(12 results)

All 2011 2010

All Journal Article (4 results) (of which Peer Reviewed: 4 results) Presentation (8 results)

[Journal Article] A multi-expert model for dialogue and behavior control of conversational robots and agents2011
- Author(s)
  Mikio Nakano, 他9名
- Journal Title
  
  Knowledge-Based Systems
  
  Volume: Vol.24, Issue 2 Pages: 248-256
- Peer Reviewed
[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010
- Author(s)
  勝丸真樹, 中野幹生, 駒谷和範, 他4名
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol.J93-D, No.6 Pages: 879-888
- Peer Reviewed
[Journal Article] Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, 他3名
- Journal Title
  
  Trend in Applied Intelligent Systems
  
  Volume: Vol.6097/2010 Pages: 585-594
- Peer Reviewed
[Journal Article] Selecting Help Messages by using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems2010
- Author(s)
  Kazunori Komatani, 他4名
- Journal Title
  
  IEICE Transaction on Information and Systems
  
  Volume: Vol.E93-D, No.12 Pages: 3359-3367
- Peer Reviewed
[Presentation] 発語行為レベルの情報を用いた音声対話システムの構築とデータ分析2011
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  (要旨集)
- Year and Date
  2011-03-25
[Presentation] 誤認識頻発状況下で選択肢列挙を行う音声対話システムとその評価2011
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Place of Presentation
  東京工業大学(東京都)
- Year and Date
  2011-03-03
[Presentation] 対話システム研究の進め方2010
- Author(s)
  中野幹生, 荒木雅弘, 駒谷和範, 他6名
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
[Presentation] 発語行為レベルの情報を活用した音声対話システムの研究2010
- Author(s)
  駒谷和範, 松山匡子, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
[Presentation] 拡張性の高いマルチドメイン対話システムのための2段ドメイン選択法2010
- Author(s)
  佐藤隼, 中野幹生, 松山匡子, 駒谷和範, 船越孝太郎, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
[Presentation] Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, 他4名
- Organizer
  Interspeech 2010
- Place of Presentation
  幕張メッセ(千葉県)
- Year and Date
  2010-09-30
[Presentation] Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination2010
- Author(s)
  Kazunori Komatani, 他5名
- Organizer
  The 23rd International Conference on Computational Linguistics (COLING-2010)
- Place of Presentation
  北京, 中国
- Year and Date
  2010-08-27
[Presentation] バージイン許容音声対話システムにおけるユーザ発話の分析と指示対象同定への応用2010
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  情報処理学会音声言語情報処理研究会(SIG-SLP)
- Place of Presentation
  秋保温泉(宮城県)
- Year and Date
  2010-07-24

2010 Fiscal Year Annual Research Report

ユーザの発語行為解釈に基づくロボット音声対話インタフェース

Principal Investigator

駒谷 和範 名古屋大学, 工学研究科, 准教授 (40362579)

Research Products

[Journal Article] A multi-expert model for dialogue and behavior control of conversational robots and agents2011

Author(s)

Journal Title

[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010

Author(s)

Journal Title

[Journal Article] Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing2010

Author(s)

Journal Title

[Journal Article] Selecting Help Messages by using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems2010

Author(s)

Journal Title

[Presentation] 発語行為レベルの情報を用いた音声対話システムの構築とデータ分析2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 誤認識頻発状況下で選択肢列挙を行う音声対話システムとその評価2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 対話システム研究の進め方2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 発語行為レベルの情報を活用した音声対話システムの研究2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 拡張性の高いマルチドメイン対話システムのための2段ドメイン選択法2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] バージイン許容音声対話システムにおけるユーザ発話の分析と指示対象同定への応用2010

Author(s)

Organizer

Place of Presentation

Year and Date

駒谷和範名古屋大学, 工学研究科, 准教授 (40362579)