2017 Fiscal Year Annual Research Report

歌声知覚を考慮した統計的歌声声質制御法に関する研究

Research Project

Project/Area Number	16J10726
Research Institution	Nagoya University
Principal Investigator	小林和弘名古屋大学, 情報基盤センター, 特別研究員(PD)
Project Period (FY)	2016-04-22 – 2018-03-31
Keywords	知覚情報 / sprocket / 歌声声質変換
Outline of Annual Research Achievements	本年度の研究実績の概要は下記のとおりである． [フリーライセンスの声質変換・制御基盤フレームワーク”sprocket”の開発と公開]　統計的声質変換・制御法に関するオープンソースソフトウェアとして，sprocketの開発・公開を実施した．本ソフトウェアに関する解説論文を執筆した．Voice Conversion Challenge2018において，ベースラインシステムとして利用される事が決まっており，今後，幅広い活躍が期待される． [歌声声質変換法に関する論文の執筆と投稿]　差分スペクトル補正に基づく歌声声質変換の研究成果を論文として執筆し，Speech Communication誌へと投稿した． [知覚情報を考慮した統計的声質制御法に関する国際会議論文の執筆]　本研究課題の核となる手法として，統計的声質制御法における，声質制御パラメータの設計法に関する研究成果を国際会議論文にまとめた．本論文は，声質制御を担う声質制御ベクトル空間において，複数の声質制御パラメータの独立性を確保し，よりユーザの知覚に合致した声質制御が実現する手法を提案する論文である．本研究成果は，評価対象として音声を用いて実施しているが，歌声声質制御においても適用可能な枠組みである．今後，歌声声質制御にも適用し，その性能を評価する予定である． [WaveNet vocoderによる声質変換・制御品質の向上]　深層学習を利用した，音声波形生成技術の一つとしてWaveNetがある．本研究では，このWaveNetのネットワークアーキテクチャを応用する枠組みとして，F0，スペクトル包絡情報，非周期性指標を補助特徴量とし，音声波形を生成するWaveNetボコーダを提案した．本提案法により，従来のボコーダの枠組みに比べて，より高い音質を持つ音声波形の生成が可能となった．
Research Progress Status	29年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	29年度が最終年度であるため、記入しない。

Research Products
(7 results)

All 2018 2017 Other

All Journal Article (2 results) (of which Int'l Joint Research: 2 results, Peer Reviewed: 2 results) Presentation (3 results) (of which Int'l Joint Research: 3 results) Remarks (2 results)

[Journal Article] Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential2018
- Author(s)
  K. Kobayashi, T. Toda, S. Nakamura
- Journal Title
  
  Speech Communication
  
  Volume: 99 Pages: 211-220
- Peer Reviewed / Int'l Joint Research
[Journal Article] Articulatory controllable speech modification based on statistical inversion and production mappings2017
- Author(s)
  P.L. Tobing, K. Kobayashi, T. Toda
- Journal Title
  
  IEEE Transactions on Audio, Speech and Language Processing
  
  Volume: 25 Pages: 2337-2350,
- Peer Reviewed / Int'l Joint Research
[Presentation] Speaker-dependent WaveNet vocoder2017
- Author(s)
  A. Tamamori, K.Kobayashi, T. Hayashi, K. Takeda, T. Toda
- Organizer
  INTERSPEECH
- Int'l Joint Research
[Presentation] Statistical voice conversion with WaveNet-based waveform generation2017
- Author(s)
  K. Kobayashi, T. Hayashi, A. Tamamori, T. Toda
- Organizer
  INTERSPEECH
- Int'l Joint Research
[Presentation] An Investigation of how to design control parameters for statistical voice timbre control2017
- Author(s)
  K. Kubo, K. Kobayashi, T. Toda, G. Neubig, S. Sakti, S. Nakamura
- Organizer
  APSIPA
- Int'l Joint Research
[Remarks] 研究室HP
- URL
  https://www.toda.is.i.nagoya-u.ac.jp/publications_FY2017.html
[Remarks] 個人HP
- URL
  https://scholar.google.co.jp/citations?user=c-AwXZQAAAAJ&hl=ja

2017 Fiscal Year Annual Research Report

歌声知覚を考慮した統計的歌声声質制御法に関する研究

Principal Investigator

小林 和弘 名古屋大学, 情報基盤センター, 特別研究員(PD)

Research Products

[Journal Article] Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential2018

Author(s)

Journal Title

[Journal Article] Articulatory controllable speech modification based on statistical inversion and production mappings2017

Author(s)

Journal Title

[Presentation] Speaker-dependent WaveNet vocoder2017

Author(s)

Organizer

[Presentation] Statistical voice conversion with WaveNet-based waveform generation2017

Author(s)

Organizer

[Presentation] An Investigation of how to design control parameters for statistical voice timbre control2017

Author(s)

Organizer

[Remarks] 研究室HP

URL

[Remarks] 個人HP

URL

小林和弘名古屋大学, 情報基盤センター, 特別研究員(PD)