A turn-taking system linked with dialogue understanding and utterance generation

Research Project

Project/Area Number	20K19821
Research Category	Grant-in-Aid for Early-Career Scientists
Allocation Type	Multi-year Fund
Review Section	Basic Section 61010:Perceptual information processing-related
Research Institution	Kyoto University
Principal Investigator	Inoue Koji 京都大学, 情報学研究科, 助教 (10838684)
Project Period (FY)	2020-04-01 – 2023-03-31
Project Status	Completed (Fiscal Year 2022)
Budget Amount *help	¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000) Fiscal Year 2022: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2021: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2020: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Keywords	音声対話システム / 会話ロボット / ターンテイキング / 発話権取得 / 話者交替 / 対話 / 意図理解 / 多人数対話 / 笑い / 言語理解 / 応答生成 / 対話理解
Outline of Research at the Start	音声対話システムにおいてユーザの発話終了を検出するターンテイキング予測は、円滑な対話を実現するための重要な機能である。従来のターンテイキングの予測モデルは先行するユーザ発話の情報のみを扱っていたが、人間どうしの会話ではより多面的な情報が考慮されているといえる。そこで、従来の音声対話システムでは独立に動作していた対話理解と発話生成のモジュールに関して、これらの動作と連動してターンテイキングを予測するシステムを提案する。さらに、これら複数のモジュールのモデルを同時に学習する統合的なモデルの構築にも取り組む。また、提案する予測モデルを音声対話システムへ搭載して、被験者実験によりその有用性を確認する。
Outline of Final Research Achievements	A novel model for turn-taking, predicting the right to speak in spoken dialogue systems, has been pioneered. To mirror human turn-taking, annotations were applied to discern the 'intent' and 'content' of each utterance within a dialogue dataset. Subsequently, a two-step turn-taking prediction model was developed. It first determines if the 'intent' or 'content' is intelligible and then decides whether to take the turn. Additionally, to enhance the functionality of spoken dialogue systems, the generation of shared laughter has been realized. A system composed of three modules for laughter detection, shared laughter prediction, and laughter type selection was proposed, demonstrating its efficacy.
Academic Significance and Societal Importance of the Research Achievements	音声対話システムは、会話ロボットやスマートスピーカに展開されている。しかし、これらのシステムによるやりとりは機械的であると言わざるを得ない。その要因の一つとしてターンテイキングが挙げられる。現在のシステムでは、発話権を取得するに際して、不自然に長い間や割り込みが生じることが多く、これにより対話の円滑さを低下させている。その一方で、人間どうしの対話では、特に意識することなく、円滑なターンテイキングが実現されている。本研究により、人間どうしのターンテイキングのメカニズムの解明に向けて、構成論的な一つのアプローチを示すことができた。

Report

(4 results)

2022 Annual Research Report Final Research Report ( PDF )
2021 Research-status Report
2020 Research-status Report

Research Products
(13 results)

All 2022 2021 2020

All Journal Article (4 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 3 results, Open Access: 4 results) Presentation (9 results) (of which Int'l Joint Research: 3 results, Invited: 1 results)

[Journal Article] Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue2022
- Author(s)
  Inoue Koji、Lala Divesh、Kawahara Tatsuya
- Journal Title
  
  Frontiers in Robotics and AI
  
  Volume: 9 Pages: 1-11
- DOI
  10.3389/frobt.2022.933261
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] An Attentive Listening System for Autonomous Android ERICA: Comparative Evaluation with Human Attentive Listeners2021
- Author(s)
  井上昂治・ララディベッシュ・山本賢太・中村静・高梨克也・河原達也
- Journal Title
  
  Transactions of the Japanese Society for Artificial Intelligence
  
  Volume: 36 Issue: 5 Pages: H-L51_1-12
- DOI
  10.1527/tjsai.36-5_H-L51
- NAID
  130008082579
- ISSN
  1346-0714, 1346-8030
- Year and Date
  2021-09-01
- Related Report
  2021 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Studies on spoken dialogue with an Android2020
- Author(s)
  井上昂治、河原達也
- Journal Title
  
  THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN
  
  Volume: 76 Issue: 4 Pages: 236-243
- DOI
  10.20697/jasj.76.4_236
- NAID
  130007920190
- ISSN
  0369-4232, 2432-2040
- Year and Date
  2020-04-01
- Related Report
  2020 Research-status Report
- Open Access
[Journal Article] 掘り下げ質問を行う就職面接対話システムの自律型アンドロイドでの実装と評価2020
- Author(s)
  井上昂治・原康平・ララディベッシュ・山本賢太・中村静・高梨克也・河原達也
- Journal Title
  
  人工知能学会論文誌
  
  Volume: 35(5) Issue: 5 Pages: 1-10
- DOI
  10.1527/tjsai.35-5_d-k43
- NAID
  130007895047
- Related Report
  2020 Research-status Report
- Peer Reviewed / Open Access
[Presentation] A multi-party attentive listening robot which stimulates involvement from side participants2021
- Author(s)
  Koji Inoue, Hiromi Sakamoto, Kenta Yamamoto, Divesh Lala, Tatsuya Kawahara
- Organizer
  SIGdial Meeting on Discourse and Dialogue (SIGDIAL)
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] 共感を表出する音声対話システムのための共有笑い生成2021
- Author(s)
  井上昂治, Lala Divesh, 河原達也
- Organizer
  人工知能学会言語・音声理解と対話処理研究会
- Related Report
  2021 Research-status Report
[Presentation] アンドロイドERICAの音声対話システム～マルチモーダルチューリングテストへの挑戦～2021
- Author(s)
  井上昂治
- Organizer
  音学シンポジウム2021
- Related Report
  2021 Research-status Report
- Invited
[Presentation] ヒューマンロボットインタラクションのための相槌・笑いのリアルタイム検出2021
- Author(s)
  井上昂治, Lala Divesh, 河原達也
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Research-status Report
[Presentation] ヒューマンロボットインタラクションにおける音響特徴に基づく共有笑いの予測2021
- Author(s)
  井上昂治, Lala Divesh, 河原達也
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Research-status Report
[Presentation] Job interviewer android with elaborate follow-up question generation2020
- Author(s)
  Koji Inoue, Kohei Hara, Divesh Lala, Kenta Yamamoto, Shizuka Nakamura, Katsuya Takanashi, Tatsuya Kawahara
- Organizer
  International Conference on Multimodal Interaction (ICMI)
- Related Report
  2020 Research-status Report
- Int'l Joint Research
[Presentation] An attentive listening system with android ERICA: Comparison of autonomous and WOZ interactions2020
- Author(s)
  Koji Inoue, Divesh Lala, Kenta Yamamoto, Shizuka Nakamura, Katsuya Takanashi, Tatsuya Kawahara
- Organizer
  SIGdial Meeting on Discourse and Dialogue (SIGDIAL)
- Related Report
  2020 Research-status Report
- Int'l Joint Research
[Presentation] アンドロイドERICAの傾聴対話システムにおけるWOZとの比較評価2020
- Author(s)
  井上昂治, Lala Divesh, 山本賢太, 中村静, 高梨克也, 河原達也
- Organizer
  人工知能学会言語・音声理解と対話処理研究会 (SIG-SLUD)
- Related Report
  2020 Research-status Report
[Presentation] WOZとの比較による自律型アンドロイドERICAの傾聴対話システムの評価2020
- Author(s)
  井上昂治, Lala Divesh, 山本賢太, 中村静, 高梨克也, 河原達也
- Organizer
  日本音響学会2020年秋季研究発表会
- Related Report
  2020 Research-status Report

A turn-taking system linked with dialogue understanding and utterance generation

Principal Investigator

Inoue Koji 京都大学, 情報学研究科, 助教 (10838684)

¥3,250,000 (Direct Cost: ¥2,500,000、Indirect Cost: ¥750,000)

Report

Research Products

[Journal Article] Can a robot laugh with you?: Shared laughter generation for empathetic spoken dialogue2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] An Attentive Listening System for Autonomous Android ERICA: Comparative Evaluation with Human Attentive Listeners2021

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Studies on spoken dialogue with an Android2020

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] 掘り下げ質問を行う就職面接対話システムの自律型アンドロイドでの実装と評価2020

Author(s)

Journal Title

DOI

NAID

Related Report

[Presentation] A multi-party attentive listening robot which stimulates involvement from side participants2021

Author(s)

Organizer

Related Report

[Presentation] 共感を表出する音声対話システムのための共有笑い生成2021

Author(s)

Organizer

Related Report

[Presentation] アンドロイドERICAの音声対話システム～マルチモーダルチューリングテストへの挑戦～2021

Author(s)

Organizer

Related Report

[Presentation] ヒューマンロボットインタラクションのための相槌・笑いのリアルタイム検出2021

Author(s)

Organizer

Related Report

[Presentation] ヒューマンロボットインタラクションにおける音響特徴に基づく共有笑いの予測2021

Author(s)

Organizer

Related Report

[Presentation] Job interviewer android with elaborate follow-up question generation2020

Author(s)

Organizer

Related Report

[Presentation] An attentive listening system with android ERICA: Comparison of autonomous and WOZ interactions2020

Author(s)

Organizer

Related Report

[Presentation] アンドロイドERICAの傾聴対話システムにおけるWOZとの比較評価2020

Author(s)

Organizer

Related Report

[Presentation] WOZとの比較による自律型アンドロイドERICAの傾聴対話システムの評価2020

Author(s)

Organizer

Related Report