発声障害者のための高品質かつ柔軟な音声合成技術の確立

Research Project

Project/Area Number	15J10727
Research Category	Grant-in-Aid for JSPS Fellows
Allocation Type	Single-year Grants
Section	国内
Research Field	Perceptual information processing
Research Institution	Nara Institute of Science and Technology
Principal Investigator	田中宏奈良先端科学技術大学院大学, 情報科学研究科, 特別研究員(DC2)
Project Period (FY)	2015-04-24 – 2017-03-31
Project Status	Completed (Fiscal Year 2016)
Budget Amount *help	¥1,900,000 (Direct Cost: ¥1,900,000) Fiscal Year 2016: ¥900,000 (Direct Cost: ¥900,000) Fiscal Year 2015: ¥1,000,000 (Direct Cost: ¥1,000,000)
Keywords	統計的電気音声発声 / 無喉頭音声 / 電気音声 / 電気式人工喉頭 / 統計的音源予測 / 生成モデル / 喉頭摘出者 / 音声合成 / 基本周波数
Outline of Annual Research Achievements	本年度の業績は，査読付き英語論文１本，査読付き国際会議２本，国内会議１本である．本研究課題は，「発声障害者のための高品質かつ柔軟な音声合成技術の確立」である．本年度は，（１）統計的電気音声発声（実時間版）のためのモデル学習および予測手法の改善，（２）統計的電気音声発声（オフライン版）のためのモデル学習および予測手法の改善を行った．（１）に関して，昨年度実装した入力される無喉頭音声（電気音声）から実時間予測される韻律情報（F0 パターン）に応じて電気式人工喉頭を直接制御する枠組み（実時間統計的電気音声発声）において，先読み予測および学習する特徴量を工夫することにより，韻律情報の予測精度を改善した．また，主観評価実験においても，従来の電気音声と比較して，大幅な自然性の改善を確認した．なお，以上の内容をまとめた論文を，電子情報通信学会の英語論文誌に投稿し，採択された．（２）に関して，オフライン版の予測精度は実時間版の予測精度の上限値となるため，オフライン版の予測精度改善は必要である．本年度は，入力される電気音声からF0 パターンを予測するためのオフライン版の統計モデルを新たに提案した．従来の統計モデルを用いて予測されるF0パターンは入力された電気音声に対して最尤であるが，時として人が発声し得ない不自然なF0パターンとなってしまう．この問題に対して，F0パターンの物理的な生成過程の制約を導入することで，電気音声に対応する自然な（人が生成し得る）F0パターンを統計的に予測する手法を提案した．評価実験により制約を組み込んだ予測処理を可能とすることで，F0パターンの予測精度を改善することを確認した．
Research Progress Status	28年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	28年度が最終年度であるため、記入しない。

Report

(2 results)

2016 Annual Research Report
2015 Annual Research Report

Research Products
(11 results)

All 2017 2016 2015 Other

All Journal Article (1 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (9 results) (of which Int'l Joint Research: 4 results) Remarks (1 results)

[Journal Article] A Vibration Control Method of an Electrolarynx based on Statistical F0 Pattern Prediction2017
- Author(s)
  Kou Tanaka, Tomoki Toda, and Satoshi Nakamura
- Journal Title
  
  IEICE Transactions Information and Systems
  
  Volume: E100-D
- Related Report
  2016 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Presentation] F0パターン生成過程の確率モデルに基づく電気音声に対するフレーズ・アクセント指令推定2017
- Author(s)
  田中宏，亀岡弘和，戸田智基，中村哲
- Organizer
  日本音響学会春季研究発表会
- Place of Presentation
  明治大学生田キャンパス（神奈川県川崎市）
- Year and Date
  2017-03-09
- Related Report
  2016 Annual Research Report
[Presentation] Evaluation of Electrolarynx Controlled by Real-time Statistical F0 Prediction2016
- Author(s)
  Kou Tanaka, Tomoki Toda, Sakriani Sakti, and Satoshi Nakamura
- Organizer
  5th Joint Meeting of the ASA and the ASJ
- Place of Presentation
  Hawaii, USA
- Year and Date
  2016-11-28
- Related Report
  2016 Annual Research Report
- Int'l Joint Research
[Presentation] Real-time vibration control of an electrolarynx based on statistical F0 contour prediction2016
- Author(s)
  Kou Tanaka, Tomoki Toda, Graham Neubig and Satoshi Nakamura
- Organizer
  Proc. EUSIPCO
- Place of Presentation
  Budapest, Hungary
- Year and Date
  2016-08-29
- Related Report
  2016 Annual Research Report
- Int'l Joint Research
[Presentation] 電気音声強調のための統計的F0予測におけるProduct-of-ExpertsによるF0パターン生成過程モデルの導入2016
- Author(s)
  田中宏，亀岡弘和，戸田智基，中村哲
- Organizer
  SP
- Place of Presentation
  別府国際コンベンションセンター B-ConPlaza (大分県、別府市)
- Year and Date
  2016-03-28
- Related Report
  2015 Annual Research Report
[Presentation] Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework2016
- Author(s)
  Kou Tanaka, Hirokazu Kameoka, Tomoki Toda, and Satoshi Nakamura
- Organizer
  ICASSP
- Place of Presentation
  中国、上海
- Year and Date
  2016-03-20
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] F0パターン生成過程を考慮したProduct-of-Expertsに基づく電気音声強調のための統計的F0予測法2016
- Author(s)
  田中宏，亀岡弘和，戸田智基，中村哲
- Organizer
  ASJ
- Place of Presentation
  桐蔭横浜大学 (神奈川県、横浜市)
- Year and Date
  2016-03-09
- Related Report
  2015 Annual Research Report
[Presentation] An enhanced electrolarynx with automatic fundamental frequency control based on statistical prediction2015
- Author(s)
  Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti and Satoshi Nakamura
- Organizer
  ASSETS
- Place of Presentation
  ポルトガル、リスボン
- Year and Date
  2015-10-26
- Related Report
  2015 Annual Research Report
- Int'l Joint Research
[Presentation] 統計的手法を用いた電気式人工喉頭制御における遅延時間と予測精度の調査2015
- Author(s)
  田中宏，戸田智基，ニュービッググラム，サクティサクリアニ，中村哲
- Organizer
  ASJ
- Place of Presentation
  会津大学 (福島県、会津若松市)
- Year and Date
  2015-09-16
- Related Report
  2015 Annual Research Report
[Presentation] リアルタイム音源予測に基づく電気式人工喉頭制御の実装2015
- Author(s)
  田中宏，戸田智基，ニュービッググラム，サクティサクリアニ，中村哲
- Organizer
  SP
- Place of Presentation
  新潟大学駅南キャンパスときめいと (新潟県、新潟市)
- Year and Date
  2015-06-18
- Related Report
  2015 Annual Research Report
[Remarks] 知能コミュニケーション研究室HP
- URL
  http://ahclab.naist.jp
- Related Report
  2016 Annual Research Report

発声障害者のための高品質かつ柔軟な音声合成技術の確立

Principal Investigator

田中 宏 奈良先端科学技術大学院大学, 情報科学研究科, 特別研究員(DC2)

¥1,900,000 (Direct Cost: ¥1,900,000)

Report

Research Products

[Journal Article] A Vibration Control Method of an Electrolarynx based on Statistical F0 Pattern Prediction2017

Author(s)

Journal Title

Related Report

[Presentation] F0パターン生成過程の確率モデルに基づく電気音声に対するフレーズ・アクセント指令推定2017

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Evaluation of Electrolarynx Controlled by Real-time Statistical F0 Prediction2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Real-time vibration control of an electrolarynx based on statistical F0 contour prediction2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 電気音声強調のための統計的F0予測におけるProduct-of-ExpertsによるF0パターン生成過程モデルの導入2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] F0パターン生成過程を考慮したProduct-of-Expertsに基づく電気音声強調のための統計的F0予測法2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] An enhanced electrolarynx with automatic fundamental frequency control based on statistical prediction2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 統計的手法を用いた電気式人工喉頭制御における遅延時間と予測精度の調査2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] リアルタイム音源予測に基づく電気式人工喉頭制御の実装2015

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Remarks] 知能コミュニケーション研究室HP

URL

Related Report

田中宏奈良先端科学技術大学院大学, 情報科学研究科, 特別研究員(DC2)