ユーザの発語行為解釈に基づくロボット音声対話インタフェース

Publicly Offered Research

Project Area	Cyber Infrastructure for the Information-explosion Era
Project/Area Number	21013029
Research Category	Grant-in-Aid for Scientific Research on Priority Areas
Allocation Type	Single-year Grants
Review Section	Science and Engineering
Research Institution	Nagoya University (2010) Kyoto University (2009)
Principal Investigator	駒谷和範名古屋大学, 工学研究科, 准教授 (40362579)
Project Period (FY)	2009 – 2010
Project Status	Completed (Fiscal Year 2010)
Budget Amount *help	¥6,200,000 (Direct Cost: ¥6,200,000) Fiscal Year 2010: ¥3,100,000 (Direct Cost: ¥3,100,000) Fiscal Year 2009: ¥3,100,000 (Direct Cost: ¥3,100,000)
Keywords	音声対話システム / 音声認識 / ロボット対話 / バージイン / 発話タイミング / 発語行為 / 音源分離 / 独立成分分析
Research Abstract	実環境下での頑健なロボット音声対話の実現を目指して,発話行為レベルの情報に着目して解釈を行う音声対話システムを構築した.つまり,音声認識結果として得られる発話内容以外に,発話タイミングや発話間の沈黙を併用してユーザの発話を解釈する.これにより,周辺雑音の多い状況でも頑健にユーザの意図を推定可能なロボット音声対話の実現を目指す.本年度は具体的には下記2点に取り組んだ. (1)ユーザや列挙項目に応じた最適な解釈重みの決定音声認識結果と発話タイミングから得られる確率を足し合わせる際の重みを,ユーザや列挙内容に応じて変化させることで,指示対象の同定精度を向上させた.これは,内容語を用いた発話を好むユーザと,タイミングによる対象指示を好むユーザが存在するという分析結果に基づいており,それぞれに応じて事前情報を与えることに相当する. (2)対話状況に応じた列挙型対話への切り替えレストラン検索を行う音声対話システムに,タイミングを用いた解釈法を実装した.この際に,音源分離部の信号雑音比や同一発話の繰り返し回数などを用いて,対話の進行が困難な状況を検出する.これにより,正しい認識結果を得るのが難しい場合はタイミングを用いた対話に誘導し,音声認識結果が信頼できそうな場合にはそれを用いて解釈を行うといった適応的な対話を実現した.この手法について,高い音声認識性能を得るのが困難な環境下で評価実験を行い,タスク達成率が向上することを示した.

Report

(2 results)

2010 Annual Research Report
2009 Annual Research Report

Research Products
(23 results)

All 2011 2010 2009

All Journal Article (5 results) (of which Peer Reviewed: 5 results) Presentation (18 results)

[Journal Article] A multi-expert model for dialogue and behavior control of conversational robots and agents2011
- Author(s)
  Mikio Nakano, 他9名
- Journal Title
  
  Knowledge-Based Systems
  
  Volume: Vol.24, Issue 2 Pages: 248-256
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010
- Author(s)
  勝丸真樹, 中野幹生, 駒谷和範, 他4名
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol.J93-D, No.6 Pages: 879-888
- NAID
  110007618361
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, 他3名
- Journal Title
  
  Trend in Applied Intelligent Systems
  
  Volume: Vol.6097/2010 Pages: 585-594
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] Selecting Help Messages by using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems2010
- Author(s)
  Kazunori Komatani, 他4名
- Journal Title
  
  IEICE Transaction on Information and Systems
  
  Volume: Vol.E93-D, No.12 Pages: 3359-3367
- NAID
  10027989211
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010
- Author(s)
  勝丸真樹, 中野幹生, 駒谷和範, 他4名
- Journal Title
  
  電子情報通信学会論文誌 Vol.J93-D
- NAID
  110007618361
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Presentation] 発語行為レベルの情報を用いた音声対話システムの構築とデータ分析2011
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  (要旨集)
- Year and Date
  2011-03-25
- Related Report
  2010 Annual Research Report
[Presentation] 誤認識頻発状況下で選択肢列挙を行う音声対話システムとその評価2011
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第73回全国大会
- Place of Presentation
  東京工業大学(東京都)
- Year and Date
  2011-03-03
- Related Report
  2010 Annual Research Report
[Presentation] 対話システム研究の進め方2010
- Author(s)
  中野幹生, 荒木雅弘, 駒谷和範, 他6名
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
- Related Report
  2010 Annual Research Report
[Presentation] 発語行為レベルの情報を活用した音声対話システムの研究2010
- Author(s)
  駒谷和範, 松山匡子, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
- Related Report
  2010 Annual Research Report
[Presentation] 拡張性の高いマルチドメイン対話システムのための2段ドメイン選択法2010
- Author(s)
  佐藤隼, 中野幹生, 松山匡子, 駒谷和範, 船越孝太郎, 奥乃博
- Organizer
  人工知能学会言語・音声理解と対話処理研究会(SIG-SLUD)
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2010-10-28
- Related Report
  2010 Annual Research Report
[Presentation] Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy2010
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, 他4名
- Organizer
  Interspeech 2010
- Place of Presentation
  幕張メッセ(千葉県)
- Year and Date
  2010-09-30
- Related Report
  2010 Annual Research Report
[Presentation] Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination2010
- Author(s)
  Kazunori Komatani, 他5名
- Organizer
  The 23rd International Conference on Computational Linguistics (COLING-2010)
- Place of Presentation
  北京, 中国
- Year and Date
  2010-08-27
- Related Report
  2010 Annual Research Report
[Presentation] バージイン許容音声対話システムにおけるユーザ発話の分析と指示対象同定への応用2010
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  情報処理学会音声言語情報処理研究会(SIG-SLP)
- Place of Presentation
  秋保温泉(宮城県)
- Year and Date
  2010-07-24
- Related Report
  2010 Annual Research Report
[Presentation] 複数の言語モデルと言語理解モデルによる音声理解手法のラピッドプロトタイピングへの適用2010
- Author(s)
  勝丸真樹, 駒谷和範, 他5名
- Organizer
  情報処理学会第72回全国大会,3U-2
- Place of Presentation
  東京大学
- Year and Date
  2010-03-09
- Related Report
  2009 Annual Research Report
[Presentation] バージイン許容音声対話におけるLSMによる許容発話範囲の拡張2010
- Author(s)
  松山匡子, 駒谷和範, 高橋徹, 尾形哲也, 奥乃博
- Organizer
  情報処理学会第72回全国大会,2ZN-2
- Place of Presentation
  東京大学
- Year and Date
  2010-03-08
- Related Report
  2009 Annual Research Report
[Presentation] 複数の言語モデルと言語理解モデルによるラピッドプロトタイピング向け音声理解2010
- Author(s)
  勝丸真樹, 中野幹生, 駒谷和範, 他4名
- Organizer
  情報処理学会音声言語情報処理研究会(SIG/SLP), 2010-SLP-80-5
- Place of Presentation
  神戸市須磨温泉寿楼
- Year and Date
  2010-02-12
- Related Report
  2009 Annual Research Report
[Presentation] Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems2009
- Author(s)
  Kazunori Komatani, 他4名
- Organizer
  10th Annual SIGDIAL Meeting on Discourse and Dialogue
- Place of Presentation
  London, UK
- Year and Date
  2009-09-12
- Related Report
  2009 Annual Research Report
[Presentation] Improving Speech Understanding Accuracy with Limited Training Data Using Multiple Language Models and Multiple Understanding Models2009
- Author(s)
  Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, 他3名
- Organizer
  Interspeech 2009
- Place of Presentation
  Brighton, UK
- Year and Date
  2009-09-10
- Related Report
  2009 Annual Research Report
[Presentation] Enabling A User To Specify An Item At Any Time During System Enumeration2009
- Author(s)
  Kyoko Matsuyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Organizer
  Interspeech 2009
- Place of Presentation
  Brighton, UK
- Year and Date
  2009-09-08
- Related Report
  2009 Annual Research Report
[Presentation] 音声対話システムにおける文法検証結果と発話履歴に基づくヘルプメッセージ候補のランキング2009
- Author(s)
  駒谷和範, 池田智志, 福林雄一朗, 尾形哲也, 奥乃博
- Organizer
  情報処理学会音声言語情報処理研究会(SIG-SLP), 2009-SLP-77-12
- Place of Presentation
  福島県飯坂温泉・飯坂ホテル聚楽
- Year and Date
  2009-07-18
- Related Report
  2009 Annual Research Report
[Presentation] Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems2009
- Author(s)
  Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G.Okuno
- Organizer
  IEA/AIE-2009, LNAI5579
- Place of Presentation
  Tainan, Taiwan
- Year and Date
  2009-06-25
- Related Report
  2009 Annual Research Report
[Presentation] A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models2009
- Author(s)
  Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, 他3名
- Organizer
  NAACL-HLT, Short Papers
- Place of Presentation
  Boulder, CO, USA
- Year and Date
  2009-06-01
- Related Report
  2009 Annual Research Report
[Presentation] バージイン発話タイミングモデルを導入した指示対象同定2009
- Author(s)
  松山匡子, 駒谷和範, 武田龍, 尾形哲也, 奥乃博
- Organizer
  情報処理学会音声言語情報処理研究会(SIG-SLP), 2009-SLP-76-14
- Place of Presentation
  東京工業大学
- Year and Date
  2009-05-22
- Related Report
  2009 Annual Research Report

ユーザの発語行為解釈に基づくロボット音声対話インタフェース

Principal Investigator

駒谷 和範 名古屋大学, 工学研究科, 准教授 (40362579)

¥6,200,000 (Direct Cost: ¥6,200,000)

Report

Research Products

[Journal Article] A multi-expert model for dialogue and behavior control of conversational robots and agents2011

Author(s)

Journal Title

Related Report

[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing2010

Author(s)

Journal Title

Related Report

[Journal Article] Selecting Help Messages by using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 複数の言語モデルと言語理解モデルによる音声理解の高精度化2010

Author(s)

Journal Title

NAID

Related Report

[Presentation] 発語行為レベルの情報を用いた音声対話システムの構築とデータ分析2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 誤認識頻発状況下で選択肢列挙を行う音声対話システムとその評価2011

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 対話システム研究の進め方2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 発語行為レベルの情報を活用した音声対話システムの研究2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 拡張性の高いマルチドメイン対話システムのための2段ドメイン選択法2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Analyzing User Utterances in Barge-in-able Spoken Dialogue System for Improving Identification Accuracy2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] バージイン許容音声対話システムにおけるユーザ発話の分析と指示対象同定への応用2010

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 複数の言語モデルと言語理解モデルによる音声理解手法のラピッドプロトタイピングへの適用2010

Author(s)

Organizer

駒谷和範名古屋大学, 工学研究科, 准教授 (40362579)