2023 年度実績報告書

多元信号を用いたリアルタイム低遅延音声変換による音声コミュニケーション拡張

研究課題

研究課題/領域番号	22KJ1519
配分区分	基金
研究機関	名古屋大学
研究代表者	HUANG WENCHIN 名古屋大学, 情報学研究科, 特別研究員(DC1)
研究期間 (年度)	2023-03-08 – 2024-03-31
キーワード	voice conversion
研究実績の概要	The purpose of this research is to apply voice conversion (VC) to realize an interactive speech production paradigm for real-world applications, with the help of multimodal signals and real-time processing techniques. In the third year, we focused on both improving fundamental VC techniques and real-time processing techniques, with particular focuses on three aspects. (1)We organized the singing voice conversion challenge 2023, a challenge that focused on improving and promoting the task of singing voice conversion, a special application of VC. We co-organized the challenge with Tencent AI Lab, China and CMU, USA, and held a special session at ASRU 2023, a flagship conference in speech processing. (2)We launched the VoiceMOS Challenge 2023, the second edition of a scientific event that encouraged research in the area of automatic prediction of Mean Opinion Scores (MOS) for synthesized speech. This year the focus was on a real-world, zero-shot setting, and the challenge attracted 10 teams from academia and industry. Again, we co-organized the challenge with NII, Japan and Academia Sinica, Taiwan, and held a special session also at ASRU 2023, a flagship conference in speech processing. (3)We proposed a sequence-to-sequence VC model that can be executed in real-time with a non-autoregressive architecture. Compared to previous works, the training pipeline is simplified, and its performance is robust against reduced training data, which is an important property for VC. The results were presented at ASJ2024, and we plan to submit a journal paper.

研究成果
(4件)

すべて 2024 2023

すべて学会発表 (4件) (うち国際学会 3件)

[学会発表] AAS-VC：非自己回帰型系列音声変換における時間対応付け学習の頑健性2024
- 著者名/発表者名
  HUANG Wen-Chin, 小林和弘, 戸田智基
- 学会等名
  音講論
[学会発表] Evaluating methods for ground-truth-free foreign accent conversion2023
- 著者名/発表者名
  Wen-Chin Huang, Tomoki Toda
- 学会等名
  APSIPA ASC
- 国際学会
[学会発表] The Singing Voice Conversion Challenge 20232023
- 著者名/発表者名
  Wen-Chin Huang, Lester Violeta, Songxiang Liu, Jiatong. Shi, Tomoki Toda
- 学会等名
  ASRU
- 国際学会
[学会発表] The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains2023
- 著者名/発表者名
  Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi
- 学会等名
  ASRU
- 国際学会

2023 年度 実績報告書

多元信号を用いたリアルタイム低遅延音声変換による音声コミュニケーション拡張

研究代表者

HUANG WENCHIN 名古屋大学, 情報学研究科, 特別研究員(DC1)

研究成果

[学会発表] AAS-VC：非自己回帰型系列音声変換における時間対応付け学習の頑健性2024

著者名/発表者名

学会等名

[学会発表] Evaluating methods for ground-truth-free foreign accent conversion2023

著者名/発表者名

学会等名

[学会発表] The Singing Voice Conversion Challenge 20232023

著者名/発表者名

学会等名

[学会発表] The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains2023

著者名/発表者名

学会等名

2023 年度実績報告書