Temporal structural modeling of conversational interaction and efficient information transfer using speech media.

Research Project

Project/Area Number	18H04128
Research Category	Grant-in-Aid for Scientific Research (A)
Allocation Type	Single-year Grants
Section	一般
Review Section	Medium-sized Section 62:Applied informatics and related fields
Research Institution	Waseda University
Principal Investigator	Kobayashi Tetsunori 早稲田大学, 理工学術院, 教授 (30162001)
Co-Investigator(Kenkyū-buntansha)	藤江真也千葉工業大学, 先進工学部, 教授 (00367062) 森大毅宇都宮大学, 工学部, 准教授 (10302184) 徳田恵一名古屋工業大学, 工学(系)研究科(研究院), 教授 (20217483)
Project Period (FY)	2018-04-01 – 2021-03-31
Project Status	Completed (Fiscal Year 2021)
Budget Amount *help	¥44,720,000 (Direct Cost: ¥34,400,000、Indirect Cost: ¥10,320,000) Fiscal Year 2020: ¥12,870,000 (Direct Cost: ¥9,900,000、Indirect Cost: ¥2,970,000) Fiscal Year 2019: ¥12,870,000 (Direct Cost: ¥9,900,000、Indirect Cost: ¥2,970,000) Fiscal Year 2018: ¥18,980,000 (Direct Cost: ¥14,600,000、Indirect Cost: ¥4,380,000)
Keywords	音声会話システム / 低遅延音声認識 / 表現豊かな音声合成 / パラ言語理解 / 発話タイミング推定 / 会話システム / 会話のリズム / 会話音声合成 / 情報アクセス / 発話タイミング制御 / 会話活性化要因解析 / 情報行動 / 情報伝達 / シナリオ主導 / 会話活性化
Outline of Final Research Achievements	In order to efficiently convey massinformation via voice media, it is important to incorporate conversational elements into the information transmission and to guarantee the rhythm of the interaction. Here, we have modeled the constraints on the temporal structure of conversational interaction that form the basis for realizing rhythmic conversation and incorporated the model into our information delivery system. The system has the ability to monitor the user's response at any time while delivering a summarized document, and to restore and present information that was reduced during summarization in response to the user's response. These features achieved efficient document transmission through spoken conversation. In addition, as important elemental technologies of the system, low-latency speech recognition technology, expressive speech synthesis technology, and paralinguistic understanding technology were developed to enhance the performance of the system.
Academic Significance and Societal Importance of the Research Achievements	音声は利用負荷が小さいメディアであり，これを利用した情報アクセス技術を確立することが古くから望まれてきたが，従来システムが扱うものはたかだか数文の短い文章に限られていた。大量の情報を音声メディアで円滑に伝える場合，情報を伝える傍らで適宜質問を受け，これに回答するといった処理を，リズム良くこなす必要があるが，これまで会話のリズムに関する研究は皆無であった。本研究によって，音声メディアにおける利便性の高い情報伝達の基礎が初めて築かれた。また，従来，情報検索・質疑応答（Pull）中心に進められてきた情報行動研究に，情報提供（Push）と取得（Pull）の密な融合という新たな視点を与えることができた。

Report

(4 results)

Research Products
(102 results)

All 2022 2021 2020 2019 2018

All Journal Article (15 results) (of which Peer Reviewed: 14 results, Open Access: 9 results) Presentation (86 results) (of which Int'l Joint Research: 16 results, Invited: 4 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] 対話システムはどのように話すべきか: 実際の会話データに基づく話し言葉の合成2022
- Author(s)
  森大毅
- Journal Title
  
  日本音響学会誌
  
  Volume: 78 Pages: 283-288
- Related Report
  2020 Annual Research Report
[Journal Article] Comparison of machine learning algorithms and acoustic features in emotion recognition from spontaneous speech2022
- Author(s)
  Takahisa Iizuka, Hiroki Mori
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 43
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components2021
- Author(s)
  Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Journal Title
  
  IEEE Access
  
  Volume: 9 Pages: 137599-137612
- DOI
  10.1109/access.2021.3118033
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Effectiveness of Speech Mode Adaptation for Improving Dialogue Speech Synthesis2019
- Author(s)
  Kazuki Kaya and Hiroki Mori
- Journal Title
  
  IEICE Transactions on Information and Systems
  
  Volume: E102.D Issue: 10 Pages: 2064-2066
- DOI
  10.1587/transinf.2019EDL8024
- NAID
  130007722181
- ISSN
  0916-8532, 1745-1361
- Year and Date
  2019-10-01
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Experimental Investigation on Expression of Additional Nuances by Controlling Sentence-final Intonation for Improving Expressiveness in Conversational Speech Synthesis2019
- Author(s)
  岩田和彦, 小林哲則
- Journal Title
  
  電子情報通信学会論文誌D 情報・システム
  
  Volume: J102-D Issue: 6 Pages: 442-453
- DOI
  10.14923/transinfj.2018JDP7055
- ISSN
  1880-4535, 1881-0225
- Year and Date
  2019-06-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Speech Synthesis for Conversational News Contents Delivery2019
- Author(s)
  高津弘明, 福岡維新, 藤江真也, 岩田和彦, 小林哲則
- Journal Title
  
  Transactions of the Japanese Society for Artificial Intelligence
  
  Volume: 34 Issue: 2 Pages: B-I65_1-15
- DOI
  10.1527/tjsai.B-I65
- NAID
  130007606815
- ISSN
  1346-0714, 1346-8030
- Year and Date
  2019-03-01
- Related Report
  2018 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Speaker adversarial training of DPGMM-based feature extractor for zero-resource languages,2019
- Author(s)
  Yosuke Higuchi, Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa,
- Journal Title
  
  Proc. Interspeech 2019
  
  Volume: - Pages: 266-270
- DOI
  10.21437/interspeech.2019-2052
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Towards Answer-unaware Conversational Question Generation2019
- Author(s)
  Mao Nakanishi, Tetsunori Kobayashi, Yoshihiko Hayashi
- Journal Title
  
  Proc. 2nd Workshop on Machine Reading for Question Answering (MRQA2019
  
  Volume: -
- DOI
  10.18653/v1/d19-5809
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Recognition of Intentions of Users’ Short Responses for Conversational News Delivery System2019
- Author(s)
  Hiroaki Takatsu, Katsuya Yokoyama, Yoichi Matsuyama, Hiroshi Honda, Shinya Fujie, Tetsunori Kobayash
- Journal Title
  
  Proc. Interspeech 2019
  
  Volume: - Pages: 1193-1197
- DOI
  10.21437/interspeech.2019-2121
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Multi-channel speech enhancement using time-domain convolutional denoising autoencoder2019
- Author(s)
  Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa
- Journal Title
  
  Proc. Interspeech 2019
  
  Volume: - Pages: 86-90
- DOI
  10.21437/interspeech.2019-3197
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Deep neural network based real-time speech vocoder with periodic and aperiodic inputs2019
- Author(s)
  Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda,
- Journal Title
  
  10th ISCA Speech Synthesis Workshop (SSW10)
  
  Volume: - Pages: 13-18
- DOI
  10.21437/ssw.2019-32
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis2019
- Author(s)
  Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Journal Title
  
  10th ISCA Speech Synthesis Workshop (SSW10)
  
  Volume: - Pages: 166-171
- DOI
  10.21437/ssw.2019-30
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Speaker-dependent WaveNet-based delay-free ADPCM speech coding2019
- Author(s)
  Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Journal Title
  
  2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
  
  Volume: - Pages: 7145-7149
- DOI
  10.1109/icassp.2019.8682264
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Singing voice synthesis based on generative adversarial networks2019
- Author(s)
  Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Journal Title
  
  2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
  
  Volume: - Pages: 6955-6959
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] Conversational and social laughter synthesis with WaveNet2019
- Author(s)
  Hiroki Mori, Tomohiro Nagata, and Yoshiko Arimoto
- Journal Title
  
  Proc. Interspeech 2019
  
  Volume: - Pages: 520-523
- DOI
  10.21437/interspeech.2019-2131
- Related Report
  2019 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] 粒度の異なるサブワード単位に基づく階層的条件付きEnd-to-End音声認識2022
- Author(s)
  樋口陽祐, 軽部敬太, 小川哲司, 小林哲則
- Organizer
  日本音響学会2022年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 声質・声の高さ・話速を変更可能なニューラルボコーダ構成法の検討2022
- Author(s)
  佐々木一匡, 吉村建慶, 高木信二, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2022年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] HSMM構造化アテンションに基づく音声合成のためのメモリ削減手法2022
- Author(s)
  藤本崇人, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2022年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 非周期性指標を考慮したニューラルボコーダの学習2022
- Author(s)
  法野行哉, 高木信二, 橋本佳, 中村和寛, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2022年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Autoregressive variational autoencoder with a hidden semi-Markov model-based structured attention for speech synthesis2022
- Author(s)
  Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda
- Organizer
  2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 複数の自然対話音声コーパスの併用によるend-to-end対話音声合成の高品質化2022
- Author(s)
  西野広直，森大毅
- Organizer
  日本音響学会2022年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] An investigation of enhancing CTC model for triggered attention-based streaming ASR2021
- Author(s)
  Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2021 (APSIPA2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Timing Generating Networks: Neural Network Based Precise Turn-Taking Timing Prediction in Multiparty Conversation2021
- Author(s)
  Shinya Fujie, Hayato Katayama, Jin Sakuma, and Tetsunori Kobayashi
- Organizer
  Interspeech 2021
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Improved Mask-CTC for non-autoregressive end-to-end ASR2021
- Author(s)
  Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi
- Organizer
  2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] End-to-End音声認識のための粒度の異なるサブワード単位に基づく階層的な条件付け2021
- Author(s)
  樋口陽祐, 軽部敬太, 小川哲司, 小林哲則
- Organizer
  情報処理学会研究報告 (SLP)
- Related Report
  2020 Annual Research Report
[Presentation] Self-Attention を用いた多人数会話向け発話タイミング推定2021
- Author(s)
  佐久間仁, 藤江真也, 小林哲則
- Organizer
  人工知能学会第93回言語・音声理解と対話処理研究会
- Related Report
  2020 Annual Research Report
[Presentation] Triggered attention型ストリーミング音声認識におけるMask-CTCを用いた事前学習2021
- Author(s)
  趙懐博, 樋口陽祐, 小林哲則, 小川哲司
- Organizer
  情報処理学会研究報告 (SLP)
- Related Report
  2020 Annual Research Report
[Presentation] Personalized Extractive Summarization for a News Dialogue System2021
- Author(s)
  Hiroaki Takatsu, Mayu Okuda, Yoichi Matsuyama, Hiroshi Honda, Shinya Fujie, and Tetsunori Kobayashi
- Organizer
  The 8th IEEE Spoken Language Technology Workshop (SLT2021)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] Noise-robust attention learning for end-to-end speech recognition2021
- Author(s)
  Yosuke Higuchi, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa
- Organizer
  The 2020 28th European Signal Processing Conference (EUSIPCO2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] TGNN による発話期待度のモデル化に基づく発話タイミング推定2021
- Author(s)
  佐久間仁, 片山颯人, 藤江真也, 小林哲則
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components2021
- Author(s)
  Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 因子分析に基づくHSMMを利用した構造化アテンション音声合成2021
- Author(s)
  高木信二, 牛田光一, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 隠れセミマルコフモデルによる構造化アテンションを用いた自己回帰型VAEに基づくsequence-to-sequence音声合成2021
- Author(s)
  藤本崇人, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 学習時と合成時の一貫性を考慮したVAEに基づく自己回帰型sequence-to-sequence音声合成2021
- Author(s)
  藤本崇人, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 隠れセミマルコフモデルに基づく構造化アテンションを用いたSequence-to-Sequence音声合成2021
- Author(s)
  角谷健太, 吉村建慶, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 周期・非周期成分の分離に基づくニューラルボコーダによる音声波形のモデル化の検討2021
- Author(s)
  法野行哉, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 勾配ブースティング決定木を用いた音声合成手法の検討2021
- Author(s)
  岩田康平, 高木信二, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 自発音声に基づく合成音声で対話するシステムがユーザに与える影響の調査2021
- Author(s)
  飯塚喬久，森大毅，西野広直
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 韻律を考慮したend-to-end方式に基づく自発音声合成2021
- Author(s)
  西野広直，森大毅
- Organizer
  日本音響学会2021年春季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 自発音声コーパスを用いて合成した音声で話すエージェントが会話相手の行動に与える影響2021
- Author(s)
  飯塚喬久，森大毅
- Organizer
  日本音響学会2021年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 感情次元の操作を目的とした声質変換手法の提案2021
- Author(s)
  向田圭汰, 森大毅
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2020 Annual Research Report
[Presentation] Mask CTC: Non-autoregressive end-to-end ASR with CTC and mask predict2020
- Author(s)
  Yosuke Higuchi, Shinji Watanabe, Nanxin Chen, Tetsuji Ogawa, Tetsunori Kobayashi
- Organizer
  The 21th Annual Conference of the International Speech Communication Association (INTERSPEECH2020)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] CTCとマスク推定に基づく推論速度の速いEnd-to-End音声認識2020
- Author(s)
  樋口陽祐, 稲熊寛文, 渡部晋治, 小川哲司, 小林哲則
- Organizer
  電子情報通信学会技術研究報告 (SP)
- Related Report
  2020 Annual Research Report
[Presentation] Timing Generating Networks: 会話の文脈を考慮したターンテイキングのタイミング推定2020
- Author(s)
  片山颯人, 藤江真也, 佐久間仁, 松山洋一, 小林哲則
- Organizer
  人工知能学会第90回言語・音声理解と対話処理研究会
- Related Report
  2020 Annual Research Report
[Presentation] Mask CTC: CTCとマスク推定に基づいた非自己回帰的なEnd-to-End音声認識2020
- Author(s)
  樋口陽祐, 渡部晋治, Chen Nanxin, 小川哲司, 小林哲則
- Organizer
  日本音響学会2020年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 音声対話システムにおける発話期待度の逐次推定に基づくターンテイキングタイミングの予測2020
- Author(s)
  藤江真也, 片山颯人, 小林哲則
- Organizer
  人工知能学会全国大会（第34回）
- Related Report
  2020 Annual Research Report
[Presentation] 会話によるニュース記事伝達のための抽出型要約のパーソナライズ2020
- Author(s)
  高津弘明, 奥田真由, 松山洋一, 本田裕, 藤江真也, 小林哲則
- Organizer
  人工知能学会全国大会（第34回）
- Related Report
  2020 Annual Research Report
[Presentation] 会話によるニュース記事伝達のためのユーザの興味と記事要約戦略の関係性分析2020
- Author(s)
  奥田真由, 高津弘明, 松山洋一, 本田裕, 藤江真也, 小林哲則
- Organizer
  人工知能学会全国大会（第34回）
- Related Report
  2020 Annual Research Report
[Presentation] 音声合成における特徴的な発話スタイルの転移学習2020
- Author(s)
  久野宏彰，高木信二，橋本佳，大浦圭一郎，南角吉彦，徳田恵一
- Organizer
  第18回情報学ワークショップ
- Related Report
  2020 Annual Research Report
[Presentation] 音声合成における敵対的生成ネットワークを用いた複数言語・複数話者モデリングの検討2020
- Author(s)
  大谷眞史，佐藤優介，高木信二，橋本佳，大浦圭一郎，南角吉彦，徳田恵一
- Organizer
  第18回情報学ワークショップ
- Related Report
  2020 Annual Research Report
[Presentation] 大規模音楽データを活用した汎用WaveNetボコーダ構成法の検討2020
- Author(s)
  佐々木一匡，吉村建慶，橋本佳，大浦圭一郎，南角吉彦，徳田恵一
- Organizer
  第18回情報学ワークショップ
- Related Report
  2020 Annual Research Report
[Presentation] 勾配ブースティング決定木を用いた高速な音声合成手法の検討2020
- Author(s)
  岩田康平，高木信二，橋本佳，南角吉彦，徳田恵一
- Organizer
  第18回情報学ワークショップ
- Related Report
  2020 Annual Research Report
[Presentation] Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis2020
- Author(s)
  Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  Interspeech 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 感情音声合成のためのDirichlet VAE2020
- Author(s)
  藤本崇人, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2020年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] DNNに基づく音声ボコーダにおける周期・非周期成分のモデル化の検討2020
- Author(s)
  法野行哉, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2020年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] 音声合成における敵対的生成ネットワークを用いた複数言語・複数話者モデリング2020
- Author(s)
  大谷眞史, 佐藤優介, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2020年秋季研究発表会
- Related Report
  2020 Annual Research Report
[Presentation] Semi-supervised learning based on hierarchical generative models for end-to-end speech synthesis2020
- Author(s)
  Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Related Report
  2020 Annual Research Report
- Int'l Joint Research
[Presentation] 質問応答の強化学習による抽象型要約の精度向上2020
- Author(s)
  高塚雅人, 小林哲則, 林良彦
- Organizer
  言語処理学会第26回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] ニュース対話システムにおける感情音声合成のためのニュース記事の文に対する感情ラベルのアノテーションと識別2020
- Author(s)
  高津弘明,安藤涼太,松山洋一,小林哲則
- Organizer
  言語処理学会第27回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 答えを用いない対話型質問の生成2020
- Author(s)
  中西真央, 小林哲則, 林良彦
- Organizer
  言語処理学会第28回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 感情推定における感情カテゴリに関する先験的知識の利用2020
- Author(s)
  田辺ひかり, 小川哲司, 小林哲則, 林良
- Organizer
  言語処理学会第29回年次大会
- Related Report
  2019 Annual Research Report
[Presentation] 音声対話システムのためのターンテイキングのタイミングの評価2020
- Author(s)
  藤江真也，小林哲則
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 多人数のための音響・言語情報の重要度を考慮した応答義務推定2020
- Author(s)
  柴田　護，藤江真也
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 傾聴対話のための音声対話ロボットの開発と評価2020
- Author(s)
  伊島翔大，関根みくり，藤江真也
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 原稿の有無が説明者の発話と被説明者の反応に与える影響の分析2020
- Author(s)
  高松屋友翼，森大毅
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] End-to-middle training based action generation for multi-party conversation robot2019
- Author(s)
  Hayato Katayama, Shinya Fujie and Tetsunori Kobayashi
- Organizer
  10th International Workshop on Spoken Dialogue Systems Technology (IWSDS) 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 多人数会話システムのための話者の状態変化に頑健な注視状態検出2019
- Author(s)
  野川賢二郎,藤江真也,小林哲則
- Organizer
  2019年度人工知能学会全国大会(第33回)
- Related Report
  2019 Annual Research Report
[Presentation] 会話によるニュース記事伝達のためのトリビアの獲得と活用2019
- Author(s)
  高津弘明,松山洋一,本田裕,藤江真也,小林哲則
- Organizer
  2019年度人工知能学会全国大会(第33回)
- Related Report
  2019 Annual Research Report
[Presentation] End-to-end学習を用いたマルチモーダル多人数会話における対話ロボットの行動ターゲット生成2019
- Author(s)
  片山颯人，藤江真也，小林哲則
- Organizer
  2019年度人工知能学会全国大会(第33回)
- Related Report
  2019 Annual Research Report
[Presentation] スマートスピーカにおける多人数会話のための音響・言語情報を用いた応答義務推定2019
- Author(s)
  柴田　護，糸日谷篤人，藤江真也
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] Statistical approach to speech synthesis: past, present and future2019
- Author(s)
  Keiichi Tokuda
- Organizer
  Interspeech 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 統計的音声合成の進展と展望2019
- Author(s)
  徳田恵一
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 統計的歌声合成技術とその実用化2019
- Author(s)
  大浦圭一郎
- Organizer
  日本AI音楽学会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 統計的パラメトリック音声合成技術とその実用化2019
- Author(s)
  大浦圭一郎
- Organizer
  情報処理学会音声言語情報処理研究会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 歌声合成におけるニューラルボコーダの比較検討2019
- Author(s)
  和田蒼汰, 法野行哉, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2019 Annual Research Report
[Presentation] 周期・非周期信号を用いたDNNに基づくリアルタイム音声ボコーダ2019
- Author(s)
  大浦圭一郎, 中村和寛, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  情報処理学会音声言語情報処理研究会
- Related Report
  2019 Annual Research Report
[Presentation] End-to-End音声合成のための階層化生成モデルに基づく半教師あり学習2019
- Author(s)
  藤本崇人, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2020年春季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 周期・非周期信号を用いた敵対的生成ネットワークに基づくリアルタイム音声ボコーダ2019
- Author(s)
  大浦圭一郎, 高木信二, 中村和寛, 橋本佳, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 楽譜時間情報を用いたアテンション機構に基づく歌声合成の検討2019
- Author(s)
  村田舜馬, 藤本崇人, 法野行哉, 高木信二, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 自発音声に対するニューラルF0モデリングの可能性2019
- Author(s)
  永田智洋, 森大毅
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] 感情表出系感動詞の鼻音化に関する種々の分析2019
- Author(s)
  高岸勇斗, 森大毅
- Organizer
  日本音響学会2019年秋季研究発表会
- Related Report
  2019 Annual Research Report
[Presentation] システム発話の文脈を考慮した発話意図理解2019
- Author(s)
  高津弘明, 横山勝矢, 本田裕, 藤江真也, 小林哲則
- Organizer
  言語処理学会第25回年次大会
- Related Report
  2018 Annual Research Report
[Presentation] 会話によるニュース記事伝達のための要約2019
- Author(s)
  高津弘明, 本田裕, 藤江真也, 林良彦, 小林哲則
- Organizer
  言語処理学会第25回年次大会
- Related Report
  2018 Annual Research Report
[Presentation] 隠れセミマルコフモデルの構造を用いたDNNに基づく音声合成における計算量削減手法の検討2019
- Author(s)
  島田基樹, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 周期・非周期信号から駆動するディープニューラルネットワークに基づく音声ボコーダ2019
- Author(s)
  藤本崇人, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] DNNに基づく感情音声合成のための敵対的学習の検討2019
- Author(s)
  角谷健太, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] イベント継続時間モデルを用いた聞き手反応の検出2019
- Author(s)
  森本洋介，森大毅
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] スマートスピーカにおける多人数会話のための応答義務推定2019
- Author(s)
  柴田　護，藤江真也
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 傾聴対話システムのための高齢者発話の継続／終了識別2019
- Author(s)
  伊島翔大，藤江真也
- Organizer
  日本音響学会 2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Investigation of Users’ Short Responses in Actual Conversation System and Automatic Recognition of their Intentions2018
- Author(s)
  K Yokoyama, H Takatsu, H Honda, S Fujie, T Kobayashi
- Organizer
  2018 IEEE Spoken Language Technology Workshop (SLT), 934-940
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 会話によるニュース記事伝達のための発話意図の分類と認識2018
- Author(s)
  横山勝矢，高津弘明，本田裕，藤江真也，小林哲則
- Organizer
  情報処理学会音声言語情報処理研究会
- Related Report
  2018 Annual Research Report
[Presentation] 会話によるニュース記事伝達のための発話意図分類とデータベースの構築2018
- Author(s)
  横山勝矢, 高津弘明, 本田裕, 藤江真也, 林良彦, 小林哲則
- Organizer
  人工知能学会全国大会
- Related Report
  2018 Annual Research Report
[Presentation] 会話によるニュース記事伝達のための発話意図理解2018
- Author(s)
  高津弘明, 横山勝矢, 本田裕, 藤江真也, 林良彦, 小林哲則
- Organizer
  人工知能学会全国大会
- Related Report
  2018 Annual Research Report
[Presentation] Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition2018
- Author(s)
  Takato Fujimoto, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures2018
- Author(s)
  Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2018)
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] The NITech text-to-speech system for the Blizzard Challenge 20182018
- Author(s)
  Kei Sawada, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, and Keiichi Tokuda
- Organizer
  Blizzard Challenge 2018 Workshop
- Related Report
  2018 Annual Research Report
- Int'l Joint Research
[Presentation] 時間構造を考慮したニューラルネットワークに基づく音声合成における話者適応の検討2018
- Author(s)
  中尾健人, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  電子情報通信学会音声研究会
- Related Report
  2018 Annual Research Report
[Presentation] 周期・非周期成分の分離に基づくWaveNetボコーダを用いた音声合成2018
- Author(s)
  藤本崇人, 吉村建慶, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会 2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Blizzard Challenge 2018のためのNITechテキスト音声合成システム2018
- Author(s)
  沢田慶, 吉村建慶, 橋本佳, 大浦圭一郎, 南角吉彦, 徳田恵一
- Organizer
  日本音響学会 2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 傾聴対話システムのための高齢者音声を用いた発話終了判定2018
- Author(s)
  伊島翔大，藤江真也
- Organizer
  第17回情報科学技術フォーラム，FIT 2018
- Related Report
  2018 Annual Research Report
[Presentation] 音声対話システムのためのユーザの発話権維持状態の逐次推定2018
- Author(s)
  藤江真也, 横山勝矢, 小林哲則
- Organizer
  人工知能学会全国大会
- Related Report
  2018 Annual Research Report
[Patent(Industrial Property Rights)] 情報再生プログラム、情報再生方法、情報処理装置及びデータ構造2020
- Inventor(s)
  高津弘明，小林哲則，藤江真也，松山洋一
- Industrial Property Rights Holder
  高津弘明，小林哲則，藤江真也，松山洋一
- Industrial Property Rights Type
  特許
- Industrial Property Number
  2020-176641
- Filing Date
  2020
- Related Report
  2020 Annual Research Report

Temporal structural modeling of conversational interaction and efficient information transfer using speech media.

Principal Investigator

Kobayashi Tetsunori 早稲田大学, 理工学術院, 教授 (30162001)

¥44,720,000 (Direct Cost: ¥34,400,000、Indirect Cost: ¥10,320,000)

Report

Research Products

[Journal Article] 対話システムはどのように話すべきか: 実際の会話データに基づく話し言葉の合成2022

Author(s)

Journal Title

Related Report

[Journal Article] Comparison of machine learning algorithms and acoustic features in emotion recognition from spontaneous speech2022

Author(s)

Journal Title

Related Report

[Journal Article] PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Effectiveness of Speech Mode Adaptation for Improving Dialogue Speech Synthesis2019

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Experimental Investigation on Expression of Additional Nuances by Controlling Sentence-final Intonation for Improving Expressiveness in Conversational Speech Synthesis2019

Author(s)

Journal Title

DOI

ISSN

Year and Date

Related Report

[Journal Article] Speech Synthesis for Conversational News Contents Delivery2019

Author(s)

Journal Title

DOI

NAID

ISSN

Year and Date

Related Report

[Journal Article] Speaker adversarial training of DPGMM-based feature extractor for zero-resource languages,2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Towards Answer-unaware Conversational Question Generation2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Recognition of Intentions of Users’ Short Responses for Conversational News Delivery System2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Multi-channel speech enhancement using time-domain convolutional denoising autoencoder2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Deep neural network based real-time speech vocoder with periodic and aperiodic inputs2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Speaker-dependent WaveNet-based delay-free ADPCM speech coding2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Singing voice synthesis based on generative adversarial networks2019

Author(s)

Journal Title